Browse > Article
http://dx.doi.org/10.5865/IJKCT.2012.2.2.017

Method of Improving Personal Name Search in Academic Information Service  

Han, Heejun (NTIS Center, Korea Institute of Science and Technology Information)
Lee, Seok-Hyoung (Department of Overseas Information, Korea Institute of Science and Technology Information, Department of Library and Information Science, Konkuk University)
Publication Information
International Journal of Knowledge Content Development & Technology / v.2, no.2, 2012 , pp. 17-29 More about this Journal
Abstract
All academic information on the web or elsewhere has its creator, that is, a subject who has created the information. The subject can be an individual, a group, or an institution, and can be a nation depending on the nature of the relevant information. Most information is composed of a title, an author, and contents. An essay which is under the academic information category has metadata including a title, an author, keyword, abstract, data about publication, place of publication, ISSN, and the like. A patent has metadata including the title, an applicant, an inventor, an attorney, IPC, number of application, and claims of the invention. Most web-based academic information services enable users to search the information by processing the meta-information. An important element is to search information by using the author field which corresponds to a personal name. This study suggests a method of efficient indexing and using the adjacent operation result ranking algorithm to which phrase search-based boosting elements are applied, and thus improving the accuracy of the search results of personal names. It also describes a method for providing the results of searching co-authors and related researchers in searching personal names. This method can be effectively applied to providing accurate and additional search results in the academic information services.
Keywords
Personal Name Search; Information Retrieval; NDSL; Indexing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Winkler, W. E. (2006). Overview of Record Linkage and Current Research Directions. Washington, DC 20233, U.S : Statistical Research Division, U.S. Census Bureau.
2 Yang, K. H., Peng, H. T., Jiang, J. Y., Lee, H. M., & Ho, J. M. (2008). Author name disambiguation for citations using topic and web correlation. European Conference on Digital Libraries - ECDL, 2008, 185-196.
3 Culotta, A., Kanani, P., Hall, R., Wick, M., & McCallum, A. (2007). Author disambiguation using error-driven machine learning with a ranking loss function. Workshop on Information Integration on the Web - WIIW, 2006, 32-37.
4 Guha, R. V., & Garg, A. (2004). Disambiguating people in search. World Wide Web Conference Series - WWW, 2004.
5 Kalashnikov, D. V., Mehrotra, S., Chen, Z., Nuray-Turan, R., & Ashish, N. (2007). Disambiguation algorithm for people search on the web. International Conference on Data Engineering - ICDE, 2007, 1258-1260.
6 Kanani, P., McCallum, A., & Pal, C. (2007). Improving author coreference by resource-bounded information gathering from the web. International Joint Conference on Artificial Intelligence, 2007, 429-434.
7 Christen, P. (2006). A comparison of personal name matching: techniques and practical issues. IEEE International Conference on Data Mining - ICDM, 2006, 290-294.
8 Pfeifer, U., Poersch, T., & Fuhr, N. (1996). Retrieval effectiveness of proper name search methods. Information Processing & Management, 32(6), 667-679.   DOI   ScienceOn
9 Piskorski, J., Wieloch, K., & Sydow, M. (2009). On knowledge-poor methods for person name matching and lemmatization for highly inflectional languages. Information retrieval, 12(3), 275-299.   DOI   ScienceOn
10 Schutze, H. (1998). Automatic word sense discrimination. Computational Linguistics, 24(1), 97-123.
11 Vu, Q. M., Masada, T., Takasu, A., & Adachi, J. (2007). Disambiguation of people in web search using a knowledge base. IEEE International Conference on Research, Innovation and Vision for the Future, 2007, 185-191.
12 Artiles, J., Gonzalo, J., & Verdejo, F. (2005). A testbed for people searching strategies in the WWW. Research and Development in Information Retrieval - SIGIR, 2005, 569-570.