Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2006.13B.2.155

Automatic Korean to English Cross Language Keyword Assignment Using MeSH Thesaurus  

Lee Jae-Sung (충북대학교 컴퓨터교육과)
Kim Mi-Suk ((재)중부직업전문학교)
Oh Yong-Soon (오창고등학교)
Lee Young-Sung (충북대학교 의과대학 의학과 의료정보학 및 관리학교실)
Abstract
The medical thesaurus, MeSH (Medical Subject Heading), has been used as a controlled vocabulary thesaurus for English medical paper indexing for a long time. In this paper, we propose an automatic cross language keyword assignment method, which assigns English MeSH index terms to the abstract of a Korean medical paper. We compare the performance with the indexing performance of human indexers and the authors. The procedure of index term assignment is that first extracting Korean MeSH terms from text, changing these terms into the corresponding English MeSH terms, and calculating the importance of the terms to find the highest rank terms as the keywords. For the process, an effective method to solve spacing variants problem is proposed. Experiment showed that the method solved the spacing variant problem and reduced the thesaurus space by about 42%. And the experiment also showed that the performance of automatic keyword assignment is much less than that of human indexers but is as good as that of authors.
Keywords
MeSH; Korean MeSH; Automatic Keyword Assignment; Automatic Indexing; Cross Language; Spacing Variants;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Aronson, Alan R. The effect of textual variation on concept based information retrieval. In proceedings of AMIA annual fall symposium, pp.373- 377, 1996
2 강병주, 최기선, 윤준태. 한국어 정보검색에서 복합명사 색인 실험. 한글 및 한국어 정보처리 학술대회, pp.130-136, 1998   과학기술학회마을
3 윤보현, 김상범, 임해창. 한국어 정보검색에서 구문적 용어불일치 완화방안. 한글 및 한국어 정보처리 학술대회 pp.143-149, 1998
4 강승식. 한국어 형태소 분석과 정보 검색. 홍릉과학출판사, 2002
5 Salton, G. 1989. Automatic text processing. Readings, Massachu-setts, Addison-Wesley series in computer science
6 KMbase. 2004. http://kmbase.medric.or.kr/
7 Manning, Christopher D., Schutze, Hinrich. Foundations of Statistical Natural Language Processing, The MIT Press, Cambridge, Massachusetts, pp.244-247, 1999
8 Hersh, W., Buddy, C., Leone, TJ. OHSUMED: An interactive retrieval evaluation and new large test collection for research. In proceedings of seventeenth annual international ACM-SIGIR conference on research and development in information retrieval. Dublin, Ireland, Spring-Verlag, pp.192-201, 1994
9 Srinivasan, P. Optimal document indexing vocabulary for MEDLINE. Information Processing & Managernent, Vol.32, No.5, pp.503-514, 1996   DOI   ScienceOn
10 김병선,김수영. 가정의학회지 논문의 영문 주제어 선택에 있어서 MeSH용어 사용 여부와 선택 정확도. 대한가정의학회지, Vol.19, No.7, pp.531-537, 1998
11 Kim, Won, Aronson, Alan R,. Wilbur, W. John. Automatic MeSH term assignment and quality assessment. In proceedings of AMIA symposium, pp.319- 323, 2001
12 Aronson, Alan R., Bodenreider, Oliver, Chang, H. F Florence, Humphrey, Susan M., Mork, James G., Nelson, Stuart J., Rindflesch, Thomas C., Wilbur, W. John. The NLM indexing initiative. In proceedings of AMIA symposium, pp.17-21,2001
13 MeSH 2004. http://www.nlm.nih.gov/mesh/