Browse > Article
http://dx.doi.org/10.9728/dcs.2017.18.5.949

HTML specification and semantics analysis of korean news sites  

Lee, Byoung-Hak (Department of Design, Hankyong University)
Publication Information
Journal of Digital Contents Society / v.18, no.5, 2017 , pp. 949-956 More about this Journal
Abstract
Visual interfaces of news sites look similar while their HTML have lots of different specifications and qualities. It's getting more and more significant to describe HTML semantically to make every computer able to understand contents to be shared as HTML5 specification refers. In this study, I have analysed HTML codes of 110 korean news sites in comparison to those of 8 global news sites. As results, 68% of news sites are still described in HTML4 specifications and only 10 out of 110 are in HTML5 specification and as high quality and strong semantics as global news sites. The result shows most korean news sites platforms had not been changed since they developed in mid-2000 and it's needed to be upgraded as language translation technologies are making it possible to share korean digital contents with the rest of world.
Keywords
news sites; HTML specification; HTML5; semantics; semantic Web;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 W3C(World Wide Web Consortium). What is HTML? [internet]. Available: https://www.w3.org/TR/1999/REC-html401-19991224/intro/intro.html#h-2.2.
2 Berners-Lee, Tim(2001, May). The Semantic Web. Scientific American.com, [internet]. Available: https://www.scientificamerican.com/article/the-semantic-web/.
3 W3C. HTML 5 [internet]. Available: https://www.w3.org/TR/2014/REC-html5-20141028/.
4 National Election Commission. The definition of internet journals [internet]. Available: http://www.nec.go.kr/portal/knowLaw/quanDetailView.do?contId=201202150112&contSid=0001&quanId=201203038058.
5 Hyun-Gee Jeon and Chan KOH, "Text Extraction Algorithm using the HTML Logical Structure Analysis", The Journal of Digital Contents Society, Vol. 16, No. 3, pp. 445-455, June 2015.   DOI
6 Daum News. HTML source [internet]. Available: http://v.media.daum.net/v/20170514094213264
7 Jeff P., Dan R., "Extracting Article Text from the Web with Maximum Subsequence Segmentation," The 18th international conference on World wide web, pp.971-980, 2009.
8 W3C. HTML 4 [internet]. Available: https://www.w3.org/TR/1999/REC-html401-19991224/intro/intro.html#h-2.3.2
9 Joongang-Il-Bo. HTML source [internet]. Available: http://news.joins.com/article/21557874?cloc=joongang|home|newslist1
10 Yonhap News. HTML source [internet]. Available: http://www.yonhapnews.co.kr/politics/2017/05/10/0501000000AKR20170510072400001.HTML?template=2085
11 Berners-Lee, Tim. Hypertext Markup Language - 2.0 [internet]. Available: https://www.w3.org/MarkUp/html-spec/html-spec_toc.html
12 Raggett, Dave. HTML 3.2 Reference Specification [internet]. Available: https://www.w3.org/TR/REC-html32-19970114
13 New York Times. HTML source [internet]. Available: https://www.nytimes.com/2017/05/09/opinion/an-agenda-for-south-koreas-new-leader.html?action=click&pgtype=Homepage&clickSource=story-heading&module=opinion-c-col-right-region(R)ion=opinion-c-col-right-region&WT.nav=opinion-c-col-right-region&_r=0
14 The Guardian. HTML source [internet]. Available: https://www.theguardian.com/world/2017/may/09/moon-jae-in-the-south-korean-pragmatist-who-would-be-presidentc
15 Byoung Hak, Lee. Analysis of korean internet journalism HTML specification and quality of semantics research result [internet]. Available: https://docs.google.com/spreadsheets/d/1BE7ZMnzVoLkkDF82MrVOxqj7fbFiGTm2HeiUGgF8gOs/edit#gid=0