Browse > Article
http://dx.doi.org/10.3743/KOSIM.2019.36.4.007

An Experimental Approach of Keyword Extraction in Korean-Chinese Text  

Jeong, Yoo Kyung (연세대학교 근대한국학연구소)
Ban, Jae-yu (연세대학교 근대한국학연구소)
Publication Information
Journal of the Korean Society for information Management / v.36, no.4, 2019 , pp. 7-19 More about this Journal
Abstract
The aim of this study is to develop a technique for keyword extraction in Korean-Chinese text in the modern period. We considered a Korean morphological analyzer and a particle in classical Chinese as a possible method for this study. We applied our method to the journal "Sisachongbo," employing proper-noun dictionaries and a list of stop words to extract index terms. The results show that our system achieved better performance than a Chinese morphological analyzer in terms of recall and precision. This study is the first research to develop an automatic indexing system in the traditional Korean-Chinese mixed text.
Keywords
corpus; automatic indexing; traditional Chinese morphological analysis; Korean-Chinese character style; Sisachongbo;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Kang, Seung-Sik, Kwon, Hyuk-Il, & Kim, Dong-Ryul (1995). The role of morphological analysis for korean automatic indexing. Proceedings of the 22th KISS Conference, 22(1), 929-932.
2 Kim, Chang-Sop (2001). Word-formation in sino-korean and a constraint of the grammar of native korean. Journal of Korean Linguistics, 37, 177-195.
3 Kim, Chang-Sop (2013). The tensification of initial consonant of the sino-korean suffix -jok(的) and the two-character word and three-character word theory. Journal of Korean Linguistics, 68, 167-188. https://doi.org/10.15811/jkl.2013..68.006   DOI
4 Kim, Pan-Jun (2006). Automatic indexing with controlled vocabulary using a descriptor profile. Proceedings of Korea Society for Information Management Conference, 2006, 153-160.
5 Park, Ju-Hee, & Myaeng, Seong-Hyeon (2017). A method for establishing korean multi-word concept boundary harnessing dictionaries and sentence segmentation for constructing concept graph. Proceedings of the 44th KISS Conference, 44(1), 651-653.
6 Lee, Sang-Bok (2012). A study on morphological analysis of Korean (1) - For proper nouns. Korean Languages, 50, 1-35.
7 Lee, Ik-Seop (1969). Chinese non 1syllable-word, a collection of scholarly papers in celebration of the 60th anniversary of Dr. Jae Won Kim, Eulyoo Publishing, 837-844.
8 Chung, Yeong-Mi (2012). Research in information retrieval. Seoul: Yonsei University Press.
9 Ju, Ji-Yeon (2015). A research on the distribution of sino-korean morphemes. Journal of Korean Linguistics, 76, 39-66.   DOI
10 Ju, Ji-Yeon (2017). To establish the sino-korean morpheme concept (1) - Focusing on the morphological status of 2syllable-word component. The Society for Korean Language & Literary Research, 45(1), 67-98.
11 Hong, Seong-Hyeok, Kim, Cheol-Su, & Lee, Yong-Seok (1996). Construction of electronic dictionary for morphological analysis in hangul - Hanja mixed sentences. Proceedings of the 23th KISS Conference, 22(1), 23(2A), 541-544.
12 Kang, Seung-Sik (2002). Korean morphological analysis and information retrieval. Seoul: Hongrung Publishing Company.