Conditional Probability of a 'Choseong', a 'Jungseong', and a 'Jongseong' Between Syllables in Multi-Syllable Korean Words

한국어 다음절 단어의 초성, 중성, 종성단위의 음절간 조건부 확률

  • Published : 1991.09.01

Abstract

A Korean word is composed of syllables. A Korean syllable is regarded as a random variable according to its probabilistic property in occurrence. A Korean syllable is divided into 'choseong', 'jungseong', and 'jongseong' which are regarded as random variables. We can consider teh conditional probatility of syllable as an index which represents the occurrence correlation between syllables in Korean words. Since the number of syllables is enormous, we use the conditional probability of a' choseong', a 'jungseong', and a 'jongseong' between syllables as an index which represents the occurrence correlation between syllables in Korean words. The length distribution of Korean woeds is computed according to frequency and to kind. Form the cumulative frequency of a Korean syllable computed from multi-syllable Korean woeds, all probabilities and conditiona probabilities are computed for the three random variables. The conditional probabilities of 'choseong'- 'choseong', 'jungseong'- 'jungseong', 'jongseong'-'jongseong', 'jongseong'-'choseong' between adjacent syllables in multi-syllable Korean woeds are computed.

Keywords