Browse > Article
http://dx.doi.org/10.3745/KTSDE.2019.8.4.163

An LSTM Method for Natural Pronunciation Expression of Foreign Words in Sentences  

Kim, Sungdon (홍익대학교 정보컴퓨터학부)
Jung, Jaehee (홍익대학교 교양학부)
Publication Information
KIPS Transactions on Software and Data Engineering / v.8, no.4, 2019 , pp. 163-170 More about this Journal
Abstract
Korea language has postpositions such as eul, reul, yi, ga, wa, and gwa, which are attached to nouns and add meaning to the sentence. When foreign notations or abbreviations are included in sentences, the appropriate postposition for the pronunciation of the foreign words may not be used. Sometimes, for natural expression of the sentence, two postpositions are used with one in parentheses as in "eul(reul)" so that both postpositions can be acceptable. This study finds examples of using unnatural postpositions when foreign words are included in Korean sentences and proposes a method for using natural postpositions by learning the final consonant pronunciation of nouns. The proposed method uses a recurrent neural network model to naturally express postpositions connected to foreign words. Furthermore, the proposed method is proven by learning and testing with the proposed method. It will be useful for composing perfect sentences for machine translation by using natural postpositions for English abbreviations or new foreign words included in Korean sentences in the future.
Keywords
Postposition; LSTM; Dropout; Overfitting; Final Consonant Pronunciation of Nouns;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Songyi Lee, "A Study on Perception of English-Transliteration Words in Newspaper Articles," Studies in Linguistics, Vol.46, No.1, pp.313-333, 2018.   DOI
2 Google Translation, [Online]. Available: https://translate.google.com
3 Naver Translation, [Online]. Available: https://papogo.naver.com
4 kakao Translation, [Online]. Available: https://translate.kakao.com
5 Lee, Donghyun, Lim, Minkyu, Park, Hosung, and Kim, Ji-Hwan, "LSTM RNN-based Korean Speech Recognition System Using CTC," Journal of Digital Contents Society, Vol.18, No.1, pp.93-99, 2017.   DOI
6 Goldberg, Yoav, "A Primer on Neural Network Models for Natural Language Processing," Journal of Artificial Intelligence Research, Vol.57, No.1, pp.345-420, 2016.   DOI
7 Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, et al., "Google's neural machine translation system: Bridging the gap between human and machine translation," arXiv preprint arXiv:1609.08144, 2016.
8 English-Korean Transliteration [Internet], https://github.com/muik/transliteration
9 WordNet: A Lexical Database for English [Internet], https://wordnet.princeton.edu/
10 Edward Loper and Steven Bird, "NLTK: the Natural Language Toolkit," ETMTNLP '02 Proceedings of the ACL-02 Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics, Vol. 1, pp.63-70.
11 TensorFlow Release [Internet], https://www.tensorflow.org/, Retrieved 14 November 2018.
12 Theano Release [Internet], http://www.deeplearning.net/software/theano/, Retrieved 17 September 2018.
13 R. Collobert, S. Bengio, and J. Marithoz, "Torch: a modular machine learning software library," Technical Report IDIAPRR02-46, IDIAP, 2002.
14 Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama and Darrell, Trevor, "Caffe: Convolutional Architecture for Fast Feature Embedding," 2014.
15 Víctor Martínez-Cagigal, ROC Curve [Internet], (https://www.mathworks.com/matlabcentral/fileexchange/52442-roc-curve), MATLAB Central File Exchange. Retrieved February 7, 2019.