Browse > Article
http://dx.doi.org/10.5391/JKIIS.2015.25.2.111

An Effect of Semantic Relatedness on Entity Disambiguation: Using Korean Wikipedia  

Kang, In-Su (School of Computer Science & Engineering, College of Engineering, Kyungsung University)
Publication Information
Journal of the Korean Institute of Intelligent Systems / v.25, no.2, 2015 , pp. 111-118 More about this Journal
Abstract
Entity linking is to link entity's name mentions occurring in text to corresponding entities within knowledge bases. Since the same entity mention may refer to different entities according to their context, entity linking needs to deal with entity disambiguation. Most recent works on entity disambiguation focus on semantic relatedness between entities and attempt to integrate semantic relatedness with entity prior probabilities and term co-occurrence. To the best of my knowledge, however, it is hard to find studies that analyze and present the pure effects of semantic relatedness on entity disambiguation. From the experimentation on Korean Wikipedia data set, this article empirically evaluates entity disambiguation approaches using semantic relatedness in terms of the following aspects: (1) the difference among semantic relatedness measures such as NGD, PMI, Jaccard, Dice, Simpson, (2) the influence of ambiguities in co-occurring entity mentions' set, and (3) the difference between individual and collective disambiguation approaches.
Keywords
Entity Linking; Entity Disambiguation; Semantic Relatedness; Wikipedia;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 J. Gracia, R. Trillo, M. Espinoza, E. Mena, "Querying the web: a multiontology disambiguation method," Proceedings of the 6th International Conference on Web Engineering, 2006.
2 K. W. Church, P. Hanks, "Word association norms, mutual information, and lexicography," Computational Linguistics, vol. 16, no. 1, pp. 22-29, 1990.
3 P. Jaccard, "Nouvelles recherches sur la distribution florale," Bull. Soc. Vaud. Sci. Nat., vol. 44, pp. 223-270, 1908.
4 G. G. Simpson, "Notes on the measurement of faunal resemblance," American Journal of Science, vol. 258a, pp. 300-311, 1960.
5 L. R. Dice, "Measures of the amount of ecologic association between species," Ecology, vol. 26, pp. 297-302, 1945.   DOI
6 S. Brin, L. Page, "The anatomy of a large-scale hypertextual Web search engine," Computer Networks, vol. 30, pp. 107-117, 1998.
7 R. Navigli, "Word sense disambiguation: a survey," ACM Computing Surveys, vol. 41, no. 2, 2009.
8 X. Han, L. Sun, J. Zhao, "Collective entity linking in web text: a graph-based method," Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011
9 O. Medelyan, I. H. Witten, D. Milne, "Topic indexing with Wikipedia," Proceedings of the Wikipedia and AI workshop at AAAI-08, 2008.
10 D. N. Milne, I. H. Witten, "Learning to link with Wikipedia," Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008.
11 P. Ferragina, U. Scaiella, "TAGME: on-the-fly annotation of short text fragments (by Wikipedia entities)," Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010.
12 S. Kulkarni, A. Singh, G. Ramakrishnan, S. Chakrabarti, "Collective annotation of Wikipedia entities in web text," Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2009.
13 J. Hoffart, M. A. Yosef, I. Bordino, H. Furstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, G. Weikum, "Robust disambiguation of named entities in text," Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011.
14 A. Islam, E. E. Milios, V. Keselj, "Comparing word relatedness measures based on Google n-grams," Proceedings of COLING 2012: Posters, 2012.
15 L. Ratinov, D. Roth, D. Downey, M. Anderson, "Local and global algorithms for disambiguation to Wikipedia," Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011.
16 R. Mihalcea, A. Csomai, "Wikify!: linking documents to encyclopedic knowledge," Proceedings of the 16th ACM Conference on Information and Knowledge Management, 2007.
17 D. Bollegala, Y. Matsuo, M. Ishizuka, "Measuring semantic similarity between words using web search engines," Proceedings of the 16th International Conference on World Wide Web, 2007.
18 C. Li, A. Sun, A. Datta, "A generalized method for word sense disambiguation based on Wikipedia," Proceedings of the 33rd European Conference on IR Research, 2011.
19 I. Kang, S. Kang, "A single-step machine learning approach to link detection in Wikipedia: NTCIR Crosslink-2 Experiments at KSLP," Proceedings of the 10th NTCIR Conference, 2013.
20 S. Kang, "English-Korean cross-lingual link discovery using link probability and named entity recognition", Journal of The Korean Institute of Intelligent Systems, vol. 23, no. 3, pp. 191-195, 2013.   DOI   ScienceOn
21 S. Hassan, R. Mihalcea, "Semantic relatedness using salient semantic analysis," Proceedings of the 25th AAAI Conference on Artificial Intelligence, 2011.
22 R. Cilibrasi, P. M. B. Vitányi, "The Google similarity distance", Available: http://arxiv.org/pdf/cs/0412098.pdf, 2004, [Accessed: October 29, 2014]