DOI QR코드

DOI QR Code

Stacked Bidirectional LSTM-CRFs를 이용한 한국어 의미역 결정

Korean Semantic Role Labeling using Stacked Bidirectional LSTM-CRFs

  • 투고 : 2016.05.26
  • 심사 : 2016.10.21
  • 발행 : 2017.01.15

초록

의미역 결정 연구에 있어 구문 분석 정보는 술어-논항 사이의 의존 관계를 포함하고 있기 때문에 의미역 결정 성능 향상에 큰 도움이 된다. 그러나 의미역 결정 이전에 구문 분석을 수행해야 하는 비용(overhead)이 발생하게 되고, 구문 분석 단계에서 발생하는 오류를 그대로 답습하는 단점이 있다. 이러한 문제점을 해결하기 위해 본 논문에서는 구문 분석 정보를 제외한 형태소 분석 정보만을 사용하는 End-to-end SRL 방식의 한국어 의미역 결정 시스템을 제안하고, 순차 데이터 모델링에 적합한 LSTM RNN을 확장한 Stacked Bidirectional LSTM-CRFs 모델을 적용해 구문 분석 정보 없이 기존 연구보다 더 높은 성능을 얻을 수 있음을 보인다.

Syntactic information represents the dependency relation between predicates and arguments, and it is helpful for improving the performance of Semantic Role Labeling systems. However, syntax analysis can cause computational overhead and inherit incorrect syntactic information. To solve this problem, we exclude syntactic information and use only morpheme information to construct Semantic Role Labeling systems. In this study, we propose an end-to-end SRL system that only uses morpheme information with Stacked Bidirectional LSTM-CRFs model by extending the LSTM RNN that is suitable for sequence labeling problem. Our experimental results show that our proposed model has better performance, as compare to other models.

키워드

과제정보

연구 과제번호 : (엑소브레인-1세부) 휴먼 지식증강 서비스를 위한 지능진화형 WiseQA 플랫폼 기술 개발

연구 과제 주관 기관 : 강원대학교, 정보통신기술진흥센터

참고문헌

  1. Changki Lee, Soojong Lim, and Hyunki Kim, "Korean Semantic Role Labeling using Structuralsvm," Journal of KIISE, Vol. 42, No. 3, pp. 220-226, 2015. https://doi.org/10.5626/JOK.2015.42.2.220
  2. Jangseong Bae, Changki Lee, and Soojong Lim, "Korean Semantic Role Labeling using Deep Learning," Proc. of the KIISE Korea Computer Congress 2015, pp. 690-692, 2015.
  3. Jangseong Bae, Changki Lee, and Soojong Lim, "Korean Semantic Role Labeling using Backward LSTM CRF," Proc. of 27th Hangul and Korean Information Processing Conference, pp. 194-197, 2015.
  4. Soojong Lim, Hyunki Kim, "A study of Korean Semantic Role Labeling using Word Sense," Proc. of 27th Hangul and Korean Information Processing Conference, pp. 194-197, 2015.
  5. Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa, "Natural Language Processing (almost) from scratch," The Journal of Machine Learning Research, Vol. 12, pp. 2493-2537, 2011.
  6. Jie Zhou and Wei Xu, "End-to-end learning of semantic role labeling using recurrent neural networks," Proc. of the Annual Meeting of the Association for Computational Linguistics, pp. 1127-1137, 2015.
  7. Kaisheng Yao, Baoling Peng, Yu Zhang, Dong Yu, Geoffery Zweig, and Yangyang Shi, "Spoken language understanding using long short-term memory neural networks," In: Spoken Language Technology Workshop(SLT), 2014 IEEE, pp. 189-194, 2014.
  8. Vasin Punyakanok, Dan Roth, and Wen-tau Yih, "The importance of syntactic parsing and inference in semantic role labeling," Computational Linguistics, Vol. 34, Issue 2, pp. 257-287, 2008. https://doi.org/10.1162/coli.2008.34.2.257
  9. Sameer Pradhan, Wayne Ward, Kadri Hacioglu, James H. Martin, and Daniel Jurafsky, "Semantic role labeling using different syntactic views," ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 581- 588, 2005.
  10. Byoung-Soo kim, Yong-Hun Lee, Seung-Hoon Na, Jun-Gi Kim, and Jong-Hyeok Lee, "Bootstrapping for Semantic Role Assignment of Korean Case Marke," Proc. of the KIISE Korea Computer Congress 2006, Vol. 33, No. 1(B), 2006.
  11. Exobrain corpus[Online]. Available: https://astc.etri. re.kr/.
  12. Martha Palmer, Shijong Ryu, Jinyoung Choi, Sinwon Yoon, and Yeongmi Jeon, Korean Propbank [Online]. Available: http://catalog.ldc.upenn.edu/LDC2006T03.
  13. Kim Wansu, Ock CheolYoung, "Korean Semantic Role Labeling using Case Frame and Frequency," Proc. of the KIISE Korea Computer Congress 2015, pp. 651-653, 2015.
  14. Jangseong Bae and Changki Lee, "End-to-end Learning of Korean Semantic Role Labeling Using Bidirectional LSTM CRF," Proc. of the KIISE Korea Computer Congress 2015, pp. 566-568, 2015.
  15. Changki Lee, "Named Entity Recognition using Long Short-Term Memory Based Recurrent Neural Network," Proc. of the KIISE Korea Computer Congress 2015, pp. 645-647, Jun. 2015.
  16. Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Janvin, "A neural probabilistic language model," The Journal of Machine Learning Research, Vol. 3, pp. 1137-1155, 2003.
  17. Tomas Mikolov, Ilya Sutskever, Kai Chen, Gregory S. Corrado, and Jeffrey Dean, "Distributed Representations of Words and Phrases and their Compositionality," Advances in neural information processing systems, pp. 3111-3119, 2013.
  18. Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever and Ruslan Salakhutdinov, "Dropout: a simple way to prevent neural networks from overfitting," Journal of Machine Learning Research, Vol. 15, No. 1, pp. 1929-1958, 2014.