1 |
B. K. Sun, "A Study of Main Contents Extraction from Web News Pages based on XPath Analysis", Journal of The Korea Society of Computer and Information, Vol. 20, No. 7, pp. 1-7, July 2015.
DOI
|
2 |
J. Si, W. Wang, "A Template-based forum posts content extraction method", International Conference on ICECE, pp.38-41, 2011.
|
3 |
R. Gunasundari, S. Karthikeyan, "A Study of content extraction from web pages based on links", International Journal of Data Mining & Knowledge management Process(IJDKP) vol.2, No.3, May 2012.
|
4 |
B. Zhou, C. Wang, Q. Su, "Chinese web page content extraction based on page content analysis", Journal of Computational Information Systems vol.5, No.6, pp.1861-1871, Dec 2009.
|
5 |
S.Pretzsch, K.Muthmann, A.Schill, "FODEX-Towards generic data extraction from web forums", 26th International conference on advanced information networking and applications workshops, pp.821-826, 2012.
|
6 |
Clearly, https://chrome.google.com/webstore/detail/clearly/iooicodkiihhpojmmeghjclgihfjdjhj
|
7 |
Readability, https://www.readability.com/
|
8 |
S.Gupta, G. Kaiser, D. Neistadt, and P. GS.Gupta, G. Kaiser, D. Neistadt, and P. Grimm, "DOM-based content extraction of HTML documents", in WWW '03: Proceedings of the 12th International Conference on WWW, ACM, pp.207-214, 2003.
|
9 |
N. Negm, P. Elkafrawy, A.B. Salem, "A Survey of Web Information Extraction Tools", International Journal of Computer Applications, Vol. 43, No. 7, pp.19-27, April 2012.
DOI
|
10 |
H. Mohammadzadeh, T. Gottron, F. Schweiggert, G. Nakhaeiza, "A Fast and accurate approach for main content extraction based on character encoding", 22nd International workshop on database and expert systems applications, pp.167-171. 2011.
|
11 |
SY. Oh, "X2RD: Storing and Quering XML Data Using XPath to Relational Database", Journal of The Korea Society of Computer and Information, Vol. 14, No. 3, pp. 57-64, March 2009.
|
12 |
XPath, http://www.w3.org/TR/xpath/
|