Browse > Article
http://dx.doi.org/10.9708/jksci.2017.22.12.035

ExoTime: Temporal Information Extraction from Korean Texts Using Knowledge Base  

Jeong, Young-Seob (Dept. of Big Data Engineering, Soonchunhyang University)
Lim, Chae-Gyun (School of Computing, KAIST)
Choi, Ho-Jin (School of Computing, KAIST)
Abstract
Extracting temporal information from documents is becoming more important, because it can be used to various applications such as Question-Answering (QA) systems, Recommendation systems, or Information Retrieval (IR) systems. Most previous studies only focus on English documents, and they are not applicable to the other languages due to the inherent characteristics of languages. In this paper, we propose a new system, named ExoTime, designed to extract temporal information from Korean documents. The ExoTime adopts an external Knowledge Base (KB) in order to achieve better prediction performance, and it also applies a bagging method to the temporal relation prediction. We show that the effectiveness of the proposed approaches by empirical results using Korean TimeBank. The ExoTime system works as a part of ExoBrain that is an artificial intelligent QA system.
Keywords
temporal information extraction; temporal expression; temporal relation; Korean TimeBank;
Citations & Related Records
연도 인용수 순위
  • Reference
1 C. N. Seon, S. Kang, and J. Seo, "Automatic Recognition and Normalization System of Korean Time Expression Using the Individual Time Units," Cognitive Science, Vol. 21, No. 4, pp. 447-58, 2010.
2 Y. Kim and J. Choi, "Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization," Healthcare Information Research, Vol. 17, No. 3, pp. 150-5, 2011.   DOI
3 G. Angeli and J. Uszkoreit, "Language-Independent Discriminative Parsing of Temporal Expressions," In Proceedings of the 51th Annual Meeting of the Association for Computational Linguistics, Soa, Bulgaria, 2013.
4 Y. S. Jeong, W. T. Joo, H. W. Do, C. G. Lim, K. S. Choi, and H. J. Choi, "Korean TimeML and Korean TimeBank," In Proceedings of the 10th edition of the Language Resources and Evaluation Conference, Portoroz, Slovenia, pp. 356-9, 2016.
5 S. Lim, C. K. Lee, J. Hur, and M. G. Jang, "Syntax Analysis of Enumeration type and Parallel Type Using Maximum Entropy Model," In Proceedings of the Korea Human Computer Interaction Conference, 1240-5, 2006.
6 E. F. T. K. Sang, and S. Buchholz, "Introduction to the CoNLL-2000 Shared Task: Chunking," In Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning, Lisbon, Portugal, pp. 127-32, 2000.
7 T. Cassidy, "Temporal Information Extraction and Knowledge Base Population," PhD thesis, The City University of New York, 2014.
8 Y. S. Jeong and H. J. Choi, "Language Independent Feature Extractor," In Proceedings of the Twenty-Ninth AAAI Conference on Articial Intelligence, Texas, USA, pp. 4170-1, 2015.
9 J. Strotgen, and M. Gertz, "HeidelTime: High Quality Rule-based Extraction and Normalization of Temporal Expressions," In Proceedings of the Fifth International Workshop on Semantic Evaluation, Uppsala, Sweden, pp. 321-4, 2010.
10 H. Jung and A. Stent, "ATT1: Temporal Annotation Using Big Windows and Rich Syntactic and Semantic Features," In Proceedings of the Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, pp. 20-4, 2013.
11 CRF++ library, http://crfpp.googlecode.com/svn/trunk/doc/index.html
12 MEM toolkit, http://homepages.inf.ed.ac.uk/lzhang10/maxent_toolkit
13 N. Chambers, T. Cassidy, B. McDowell, and S. Bethard, "Dense Event Ordering with a Multi-Pass Architecture," Transactions of the Association for Computational Linguistics, Vol. 2, pp. 273-84, 2014.
14 Y. S. Jeong, Z. M. Kim, H. W. Do, C. G. Lim, and H. J. Choi, "Temporal Information Extraction from Korean Texts," In Proceedings of the 19th Conference on Computational Language Learning, Beijing, China, pp. 279-88, 2015.
15 J. Pustejovsky, J. Castano, R. Ingria, R. Sauri, R. Gaizauskas, A. Setzer, and G. Katz, "TimeML: Robust Specication of Event and Temporal Expressions in Text," In New Directions in Question Answering, Stanford, USA, pp. 28-34, 2003.
16 T. Caselli, V. B. Lenzi, R. Sprugnoli, E. Pianta, and I. Prodanof, "Annotating Events, Temporal Expressions and Relations in Italian: the It-TimeML Experience for the Ita-TimeBank," In Proceedings of the Fifth Law Workshop, Partland, Oregon, pp. 143-51, 2011.
17 S. Im, H. You, H. Jang, S. Nam, and H. Shin, "KTimeML: Specication of Temporal and Event Expressions in Korean Text," In Proceedings of the 7th Workshop on Asian Language Resources, Suntec, Singapore, pp. 115-22, 2009.
18 M. Verhagen, R. J. Gaizauskas, F. Schilder, M. Hepple, J. Moszkowicz, and J. Pustejovsky, "The TempEval Challenge: Identifying Temporal Relations in Text," Language Resources and Evaluation, Vol. 43, No. 2, pp. 161-79, 2009.   DOI
19 M. Verhagen, R. Sauri, T. Caselli, and J. Pustejovsky, "SemEval-2010 task 13: TempEval-2," In Proceedings of the Fifth International Workshop on Semantic Evaluation, Uppsala, Sweden, pp. 57-62, 2010.
20 N. UzZaman, H. Llorens, L. Derczynski, M. Verhagen, J. Allen, and J. Pustejovsky, "SemEval-2013 Task 1: TEMPEVAL-3: Evaluating Time Expressions, Events, and Temporal Relations," In Proceedings of the Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, USA, pp. 1-9, 2013.
21 S. B. Jang, J. Baldwin, and I. Mani, "Automatic TIMEX2 Tagging of Korean News," ACM Transactions on Asian Language Information Processing, Vol. 3, No. 1, pp. 51-65, 2004.   DOI
22 J. Strotgen, M. Gertz, and P. Popov, "Extraction and Exploration of Spatio-Temporal Information in Documents," In Proceedings of the 6th Workshop on Geographic Information Retrieval Article, Zurich, Switzerland, pp. 698-706, 2010.
23 A. Berglund, R. Johansson, and P. Nugues, "A Machine Learning Approach to Extract Temporal Information from Texts in Swedish and Generate Animated 3D Scenes," In Proceedings of the 11st Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, pp. 385-92, 2006.
24 N. Chambers, S. Wang, and D. Jurafsky, "Classifying Temporal Relations Between Events," In Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Prague, Czech Republic, pp. 173-6, 2007.
25 N. UzZaman and J. Allen, "Event and Temporal Expression Extraction from Raw Text: First Step towards a Temporally Aware System," International Journal of Semantic Computing, Vol. 4, No. 4, pp. 487-508, 2010.   DOI
26 B. Tang, Y. Wu, M. Jiang, Y. Chen, J. C. Denny, and H. Xu, "A Hybrid System for Temporal Information Extraction from Clinical Text," Journal of the American Medical Informatics Association, Vol. 20, No. 5, pp. 828-35, 2013.   DOI