Browse > Article
http://dx.doi.org/10.3745/KTSDE.2014.3.7.285

Named Entity Recognition and Dictionary Construction for Korean Title: Books, Movies, Music and TV Programs  

Park, Yongmin (충북대학교 디지털정보융합학과)
Lee, Jae Sung (충북대학교 소프트웨어학과)
Publication Information
KIPS Transactions on Software and Data Engineering / v.3, no.7, 2014 , pp. 285-292 More about this Journal
Abstract
A named entity recognition method is used to improve the performance of information retrieval systems, question answering systems, machine translation systems and so on. The targets of the named entity recognition are usually PLOs (persons, locations and organizations). They are usually proper nouns or unregistered words, and traditional named entity recognizers use these characteristics to find out named entity candidates. The titles of books, movies and TV programs have different characteristics than PLO entities. They are sometimes multiple phrases, one sentence, or special characters. This makes it difficult to find the named entity candidates. In this paper we propose a method to quickly extract title named entities from news articles and automatically build a named entity dictionary for the titles. For the candidates identification, the word phrases enclosed with special symbols in a sentence are firstly extracted, and then verified by the SVM with using feature words and their distances. For the classification of the extracted title candidates, SVM is used with the mutual information of word contexts.
Keywords
Named Entity Recognition; Title Named Entity; Dictionary Construction; SVM;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Seong-Won Kim, Dong-Yul Ra, "Korean Named Entity Recognition Using Two-level Maximum Entropy Model,"Proc. of the KIISE Symosium, Vol.2, No.1, pp.81-86, 2008.
2 Changki Lee, Myung-Gil Jang, "Named Entity Recognition with Structural SVMs and Pegasos algorithm," Proc. of KSCS Congnitive Science, Vol.21, No.4, pp.655-667, 2010.   과학기술학회마을   DOI
3 Joo-Young Lee, Young-In Song, Hae-Chang Rim, "Title Named Entity Recognition based on Automatically Constructed Context Patterns and Entity Dictionary," Proc. of the KIISE Conference, The 16th Annual Conference on Human & Cognitive Language Technology, pp.40-45, 2004.   과학기술학회마을
4 Black, W., F. Rinaldi and D. Mowatt, "Facile: Description Of The Ne System Used For Muc-7," in Proceedings of the 7th Message Understanding Conference, 1998.
5 Chen H., Y. Ding, S. Tsai and G. Bian, "Description of the NTU System Used for MET2," in Proceedings of 7th Message Understanding Conference, 1998.
6 Aberdeen, J., J. D. Burger, D. S. Day, L. Hirschman, P. Robinson and M. B. Vilain, "MITRE : Description Of The Alembic System Used For MUC-6," in Proceedings of 6th Message Understanding Conference, pp.141-155, 1995.
7 Kyung Hee Lee, Ju Ho Lee, Myung Seok Choi, Gil Chang Kim, "Study on Named Entity Recognition in Korean Text," Proc. of the KIISE Conference, The 12th Annual Conference on Human & Cognitive Language Technology, pp.292-299, 2000.
8 Borthwick, A., J. Sterling, E. Agichtein and R. Grishman, "NYU : Description of the MENE Named Entity System as Used in MUC-7," in Proceedings of 7th Message Understanding Conference, 1998.
9 Merchant, R. and M. E. Okurowski, "The multilingual entity task (MET) overview," in Proceeding TIPSTER'96 Proceedings of a workshop on held at Vienna, pp.445-447, 1996.
10 Sekine, S. and Y. Eriguchi, "Japanese named entity extraction evaluation : analysis of results," in Proceeding COLING'00 Proceedings of the 18th conference on Computational linguistics - Vol.2, pp.1106-1110, 2000.
11 Yi-Gyu Hwang, Hyun-Sook Lee, Eui-Sok Chung, Bo-Hyun Yun, Sang-Kyu Park, "Korean Named Entity Recognition Based on Supervised Learning Using Named Entity Construction Principles," Proc. of the KIISE Conference, The 14th Annual Conference on Human & Cognitive Language Technology, pp.111-117, 2002.   과학기술학회마을
12 Hae-Suk Jang, Kyu-Cheol Jung, Jin Kwan Lee, Kihong Park, "Recognition of Korean Place Names on the Internet by Using the Rules of Dictionary Use," Proc. of the KSII Fall Conference, Vol.6, No.1, pp.397-400, 2005.
13 Yi-Gyu Hwang, Bo-Hyun Yun, "HMM-based Korean Named Entity Recognition," Proc. of the KIPS Transaction Vol.10(B), No.2, pp.229-236, 2003.   과학기술학회마을   DOI
14 Changki Lee, Yi-Gyu Hwang, Hyo-Jung Oh, Soojung Lim, Jeong Heo, Chung-Hee Lee, Hyeon-Jin Kim, Ji-Hyun Wang, Myung-Gil Jang, "Fine-Grained Named Entity Recognition using Conditional Random Fields for Question Answering," Proc. of the KIISE Conference, The 18th Annual Conference on Human & Cognitive Language Technology, pp.268-272, 2006.   과학기술학회마을
15 Young-Min Park, Sang-woo Kang, Byoung-Kyu Yoo, Jung-Yun Seo, "Title Named Entity Recognition using Wikipedia and Making Acronym," Proc. of the KIISE Korea Computer Congress, pp.637-639, 2013.
16 Vapnik, V. N., The nature of statistical learning theory, Springer, 1995.
17 Dumais, S., J. Platt and D. Heckerman, "Inductive Learning Algorithms and Representations for Text Categorization," in Proceeding of ACM-CIKM '98, pp.148-155, 1998.
18 Lai, A., "Movie Title Recognition in E-Mail," Stanford University Natural Language Processing, CS224N Final Project, 2009.
19 Crammer, K., Y. Singer, "On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines," Journal of Machine Learning Research 2, pp.265-292, 2001.
20 Peng H., F. Long and C. Ding, "Feature Selection Based on Mutual Information: Criteria of Max- Dependency, Max- Relevance, and Min-Redundancy," Pattern Analysis and Machine Intelligence, IEEE Transactions on Vol.27, Issue 8, pp.1226-1238, 2005.   DOI   ScienceOn