Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2009.16-B.3.225

WordNet-Based Category Utility Approach for Author Name Disambiguation  

Kim, Je-Min (숭실대학교 컴퓨터학과)
Park, Young-Tack (숭실대학교 컴퓨터학과)
Abstract
Author name disambiguation is essential for improving performance of document indexing, retrieval, and web search. Author name disambiguation resolves the conflict when multiple authors share the same name label. This paper introduces a novel approach which exploits ontologies and WordNet-based category utility for author name disambiguation. Our method utilizes author knowledge in the form of populated ontology that uses various types of properties: titles, abstracts and co-authors of papers and authors' affiliation. Author ontology has been constructed in the artificial intelligence and semantic web areas semi-automatically using OWL API and heuristics. Author name disambiguation determines the correct author from various candidate authors in the populated author ontology. Candidate authors are evaluated using proposed WordNet-based category utility to resolve disambiguation. Category utility is a tradeoff between intra-class similarity and inter-class dissimilarity of author instances, where author instances are described in terms of attribute-value pairs. WordNet-based category utility has been proposed to exploit concept information in WordNet for semantic analysis for disambiguation. Experiments using the WordNet-based category utility increase the number of disambiguation by about 10% compared with that of category utility, and increase the overall amount of accuracy by around 98%.
Keywords
Ontology; Metadata; Category Utility; Author Name Disambiguation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Borislav Popov, Atanas Kiryakov, Angel Kirilov, Dimitar Manov, Damyan Ognyanoff, Miroslav Goranov, 'KIM . Semantic Annotation Platform', Proceeding of the 2nd International Semantic Web Conference, Sanibel Island, Florida, 2003
2 Douglas H. Fisher, 'Knowledge Acquisition Via Incremental Conceptual Clustering', Machine Learning, Vol.2, pp.139-172, 1987   DOI
3 Norberto Fernandez Garcia, Jose Maria Blazquez del Toro, Luis Sanchez Fernandez and Ansgar Bernardi, 'IdentityRank: Named Entity Disambiguation in the Context of the NEWS Project', 4th European Semantic Web Conference, Innsbruck, Austria, 2007   DOI   ScienceOn
4 Hui Han, Lee Giles, Hongyuan Zha, 'Two Supervised Learning Approaches for Name Disambiguation in Author Citations', 4th Joint Conference on Digital Libraries, Tucson, Arizona, USA, 2004   DOI
5 Stephen Dill, Nadav Eiron, David Gibson, Daniel Gruhl, R.Guha, Anant Jhingran, Tapas Kanungo, Sridhar Rajagopalan, Andrew Tomkins, John A. Tomlin, Jason Y. Zien, 'SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation', 20th World Wide Web conference, Budapest, Hungary, 2003   DOI
6 Thamar Solorio, 'Improvement of Named Entity Tagging by Machine Learning', Technical Report CCC-04-004, Coordinacin de Ciencias Computacionales, 2004
7 Michael Erdmann, Alexander Maedche, Hans-Peter Schnurr, Steffen Staab, 'From Manual to Semi-automatic Semantic Annotation: About Ontology-based Text Annotation Tools', Proceedings of the COLING 2000 Workshop on Semantic Annotation and Intelligent Content, Luxembourg, 2000
8 Alexiei Dingli, Fabio Ciravegna, Yorick Wilks, 'Automatic Semantic Annotation using Unsupervised Information Extraction and Integration', K-CAP 2003 Workshop on Knowledge Markup and Semantic Annotation, 2003
9 Joseph Hassell, Boanerges Aleman-Meza, I.Budak Arpinar, 'Ontology-Driven Automatic Entity Disambiguation in Unstructured Text', 5th International Semantic Web Conference, Athens, GA, USA, 2006
10 Hui Han, Hongyuan Zha, C. Lee Giles, 'Name Disambiguation in Author Citations using a K-way Spectral Clustering Method', 5th Joint Conference on Digital Libraries, Denver, Colorado, USA, 2004   DOI
11 Ziming Zhuang, Rohit Wagle, C. Lee Giles, 'What's There and What's Not? Focused Crawling for Missing Documents in Digital Libraries', 5th Joint Conference on Digital Libraries, Denver, Colorado, USA 2004   DOI
12 WordNet, http://wordnet.princeton.edu/
13 Yiming Yang, and Jan O.Pedersen, 'A comparative study on Feature Selection in Text Categorization', Proceedings of ICML-97, 14th International Conference on Machine Learning, 1997