• 제목/요약/키워드: word-initial syllable

검색결과 39건 처리시간 0.02초

음절을 기반으로한 한국어 음성인식 (Korean Speech Recognition Based on Syllable)

  • 이영호;정홍
    • 전자공학회논문지B
    • /
    • 제31B권1호
    • /
    • pp.11-22
    • /
    • 1994
  • For the conventional systme based on word, it is very difficult to enlarge the number of vocabulary. To cope with this problem, we must use more fundamental units of speech. For example, syllables and phonemes are such units, Korean speech consists of initial consonants, middle vowels and final consonants and has characteristic that we can obtain syllables from speech easily. In this paper, we show a speech recognition system with the advantage of the syllable characteristics peculiar to the Korean speech. The algorithm of recognition system is the Time Delay Neural Network. To recognize many recognition units, system consists of initial consonants, middle vowels, and final consonants recognition neural network. At first, our system recognizes initial consonants, middle vowels and final consonants. Then using this results, system recognizes isolated words. Through experiments, we got 85.12% recognition rate for 2735 data of initial consonants, 86.95% recognition rate for 3110 data of middle vowels, and 90.58% recognition rate for 1615 data of final consonants. And we got 71.2% recognition rate for 250 data of isolated words.

  • PDF

Acoustic analysis of Korean trisyllabic words produced by English and Korean speakers

  • Lee, Jeong-Hwa;Rhee, Seok-Chae
    • 말소리와 음성과학
    • /
    • 제10권2호
    • /
    • pp.1-6
    • /
    • 2018
  • The current study aimed to investigate the transfer of English word stress rules to the production of Korean trisyllabic words by L1 English learners of Korean. It compared English and Korean speakers' productions of seven Korean words from the corpus L2KSC (Rhee et al., 2005). To this end, it analyzed the syllable duration, intensity, and pitch. The results showed that English and Korean speakers' pronunciations differed markedly in duration and intensity. English learners produced word-initial syllables of greater intensity than Korean speakers, while Korean speakers produced word-final syllables of longer duration than English learners. However, these differences between the two speaker groups were not related to the expected L1 transfer. The tonal patterns produced by English and Korean speakers were similar, reflecting L1 English speakers' learning of the L2 Korean prosodic system.

한국어 음절의 표기빈도와 형태소빈도가 단어인지에 미치는 효과 (Effects of orthographic and morphological frequency of a syllable in Korean word recognition)

  • 이광오;배성봉
    • 인지과학
    • /
    • 제20권3호
    • /
    • pp.309-333
    • /
    • 2009
  • 2음절 한자 합성어의 어휘판단에서 형태소 처리와 글자 처리의 역할을 조사하였다. 실험 1의 단어에 대한 반응에서는 어두와 어말 위치 모두에서 형태소 빈도의 효과는 나타나지 않았으나, 비단어에 대한 반응에서는 글자 빈도의 효과와 글자-형태소 대응의 효과가 나타났다. 빈도가 높은 글자를 포함하는 비단어일수록 반응시간이 길었고, 글자-형태소의 대응이 불투명한 비단어일수록 반응시간이 길었다. 실험 2에서는 실험 1에서 나타난 글자-형태소 대응의 효과를 단어에서 직접 관찰하고자 하였다. 그 결과, 단어 자극에 대해서도 글자-형태소 대응이 불투명할수록 어휘 판단이 느렸으며, 비단어 자극에서 그러한 경향이 더 뚜렷하였다. 본 연구의 결과는, 글자-형태소 대응이 불투명한 단어의 경우 다양한 형태소를 활성화시키게 되고, 그 결과 형태소의 파악은 늦어지고, 결국은 단어 인지의 지연으로 연결된다는 주장을 지지한다. 실험 결과를 바탕으로 하여 한글 표기 한자어의 인지에서 형태소 위치 효과, 글자 빈도의 역할 등에 대해서 논의하였다.

  • PDF

음성인식에서 문맥의존 음향모델의 성능향상을 위한 유사음소단위에 관한 연구 (A Study on Phoneme Likely Units to Improve the Performance of Context-dependent Acoustic Models in Speech Recognition)

  • 임영춘;오세진;김광동;노덕규;송민규;정현열
    • 한국음향학회지
    • /
    • 제22권5호
    • /
    • pp.388-402
    • /
    • 2003
  • In this paper, we carried out the word, 4 continuous digits. continuous, and task-independent word recognition experiments to verify the effectiveness of the re-defined phoneme-likely units (PLUs) for the phonetic decision tree based HM-Net (Hidden Markov Network) context-dependent (CD) acoustic modeling in Korean appropriately. In case of the 48 PLUs, the phonemes /ㅂ/, /ㄷ/, /ㄱ/ are separated by initial sound, medial vowel, final consonant, and the consonants /ㄹ/, /ㅈ/, /ㅎ/ are also separated by initial sound, final consonant according to the position of syllable, word, and sentence, respectively. In this paper. therefore, we re-define the 39 PLUs by unifying the one phoneme in the separated initial sound, medial vowel, and final consonant of the 48 PLUs to construct the CD acoustic models effectively. Through the experimental results using the re-defined 39 PLUs, in word recognition experiments with the context-independent (CI) acoustic models, the 48 PLUs has an average of 7.06%, higher recognition accuracy than the 39 PLUs used. But in the speaker-independent word recognition experiments with the CD acoustic models, the 39 PLUs has an average of 0.61% better recognition accuracy than the 48 PLUs used. In the 4 continuous digits recognition experiments with the liaison phenomena. the 39 PLUs has also an average of 6.55% higher recognition accuracy. And then, in continuous speech recognition experiments, the 39 PLUs has an average of 15.08% better recognition accuracy than the 48 PLUs used too. Finally, though the 48, 39 PLUs have the lower recognition accuracy, the 39 PLUs has an average of 1.17% higher recognition characteristic than the 48 PLUs used in the task-independent word recognition experiments according to the unknown contextual factor. Through the above experiments, we verified the effectiveness of the re-defined 39 PLUs compared to the 48PLUs to construct the CD acoustic models in this paper.

Syllable Structure Constraints and the Perception of Biconsonantal Clusters by Korean EFL Learners

  • Lee, Shinsook
    • 영어영문학
    • /
    • 제55권6호
    • /
    • pp.1193-1220
    • /
    • 2009
  • This study examined the impact of sonority profiles, positional differences and L2 proficiency on Korean EFL learners' perception of English biconsonantal clusters, using nonce words. The overall results showed that major predictions of the sonority-based typological markedness on consonant clusters were supported, as obstruent plus sonorant and sonorant plus obstruent sequences were better perceived than obstruent only or sonorant only sequences. Yet, some consonant clusters did not show a preference for sonority profiles. Positional effects were also confirmed, as word-initial biconsonantal clusters were better perceived than wordfinal ones across all the participant groups. Participants' English proficiency turned out to be also important in the perception of consonant clusters, since university students' mean rate of accuracy was highest, followed by that of high school students, which in turn followed by that of middle school students. Further, the effects of other factors like frequency and stimuli on speech perception were also addressed, along with some implications for future research.

자연스러운 여성 합성음을 위한 한국어의 피치 변화 법칙 (The Rule of Korean Pitch Variation for a Natural Synthetic Female Voice)

  • 김중원;박대덕;김보현;권철홍
    • 한국음향학회지
    • /
    • 제15권6호
    • /
    • pp.26-32
    • /
    • 1996
  • 본 논문은 자연스러운 여성 합성음을 위한 피치 변화 법칙을 세웠다. 피치 변화 법칙이 적용되는 기본 단위, 즉 억양구는 주로 어절(들)로 이것의 첫번째, 두번째, 마지막 음절의 피치값을 연결해 피치 변화 곡선을 형성하였는데, 첫번째, 두번째 음절의 피치값은 각 음절의 초성에 따라, 마지막 음절의 피치값은 기능어의 종류에 따라 결정되었다. 억양구 사이에는 '쉼(pause)이 있는 경계' 또는 '쉼이 없는 경계'가 오며, 쉼이 있는 경계에는 relaxation이 있다. 이러한 억양구의 피치 변화 곡선, 경계 현상들이 모여 한 문장의 피치 턴을 만들었다.

  • PDF

Articulatory modification of /m/ in the coda and the onset as a function of prosodic boundary strength and focus in Korean

  • 김사향;조태홍
    • 말소리와 음성과학
    • /
    • 제6권4호
    • /
    • pp.3-15
    • /
    • 2014
  • An articulatory study (using an Electromagnetic Articulography, EMA) was conducted to explore effects of prosodic boundary strength (Intonational Phrase/IP versus Word/Wd), and focus (Focused/accented, Neutral, Unfocused/unaccented) on the kinematic realization of /m/ in the coda (${\ldots}$am#i${\ldots}$) and the onset (${\ldots}$a#mi${\ldots}$) conditions in Korean. (Here # refers to a prosodic boundary such as an IP or a Wd boundary). Several important points have emerged. First, the boundary effect on /m/s was most robustly observed in the temporal dimension in both the coda (IP-final) and the onset (IP-initial) conditions, generally in line with cross-linguistically observable boundary-related lengthening patterns. Crucially, however, in contrast with boundary-related slowing-down effects that have been observed in English, both the IP-final and IP-initial temporal expansions of Korean /m/s were not accompanied by an articulatory slowing down. They were, if anything, associated with a faster movement in the lip opening (release) phase (into the vowel). This suggests that the mechanisms underlying boundary-related temporal expansions may differ between languages. Second, observed boundary-induced strengthening effects (both spatial and temporal expansions, especially on the IP-initial /m/s) were remarkably similar to prominence (focus)-induced strengthening effects, which is again counter to phrase-initial strengthening patterns observed in English in which boundary effects are dissociated from prominent effects. This suggests that initial syllables in Korean may be a common focus for both boundary and prominence marking. These results, taken together, imply that the boundary-induced strengthening in Korean is different in nature from that in English, each being modulated by the individual language's prosodic system. Third, the coda and the onset /m/s were found to be produced in a subtly but significantly different way even in a Wd boundary condition, a potentially neutralizing (resyllabification) context. This suggests that although the coda may be phonologically 'resyllabified' into the following syllable in a phrase-medial position, its underlying syllable affiliation is kinematically distinguished from the onset.

Durational Interaction of Stops and Vowels in English and Korean Child-Directed Speech

  • Choi, Han-Sook
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.61-70
    • /
    • 2012
  • The current study observes the durational interaction of tautosyllabic consonants and vowels in the word-initial position of English and Korean child-directed speech (CDS). The effect of phonological laryngeal contrasts in stops on the following vowel duration, and the effect of the intrinsic vowel duration on the release duration of preceding stops in addition to the acoustic realization of the contrastive segments are explored in different prosodic contexts - phrase-initial/medial, focal accented/non-focused - in a marked speech style of CDS. A trade-off relationship between Voice Onset Time (VOT), as consonant release duration, and voicing phonation time, as vowel duration, reported from adult-to-adult speech, and patterns of durational variability are investigated in CDS of two languages with different linguistic rhythms, under systematically controlled prosodic contexts. Speech data were collected from four native English mothers and four native Korean mothers who were talking to their one-word staged infants. In addition to the acoustic measurements, the transformed delta measure is employed as a variability index of individual tokens. Results confirm the durational correlation between prevocalic consonants and following vowels. The interaction is revealed in a compensatory pattern such as longer VOTs followed by shorter vowel durations in both languages. An asymmetry is found in CV interaction in that the effect of consonant on vowel duration is greater than the VOT differences induced by the vowel. Prosodic effects are found such that the acoustic difference is enhanced between the contrastive segments under focal accent, supporting the paradigmatic strengthening effect. Positional variation, however, does not show any systematic effects on the variations of the measured acoustic quantities. Overall vowel duration and syllable duration are longer in English tokens but involve less variability across the prosodic variations. The constancy of syllable duration, therefore, is not found to be more strongly sustained in Korean CDS. The stylistic variation is discussed in relation to the listener under linguistic development in CDS.

한글 자음과 모음결합을 이용한 학습용 퍼즐게임 구현 (Implementation of Learning Puzzle Game by using Combination of Korean Alphabet)

  • 조재영;김윤호
    • 디지털콘텐츠학회 논문지
    • /
    • 제7권4호
    • /
    • pp.257-261
    • /
    • 2006
  • 본 논문에서는 한글의 자음과 모음을 별도로 분류 한 후, 자음과 모음을 실시간으로 조합하여 단어를 만드는 퍼즐게임을 구현하였다. 단어 조합기는 API 에서 지원하는 에디터를 이용하여 구현하였고, 효율적인 조합단어의 검색을 위하여 초기 합성소 자음기반 방식을 이용하였다. 구현된 한글 조합 퍼즐게임은 아동들의 단어 학습 능력의 향상과 한글과 친해질 수 있는 기대 효과를 갖는다.

  • PDF

A PHONEMIC ANALYSIS OF THE UNWRITTEN LANGUAGE OF THE PULANG TRIBE

  • Kang, Su-Hee
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2000년도 7월 학술대회지
    • /
    • pp.166-177
    • /
    • 2000
  • The purpose of this study was to create letters for of nonliterary Pulang tribe in Thailand those who immigrant from China. illiterate Pulang tribe hand down their tradition by primary oral culture therefore their tradition can't initiate and keep, moreover, it may disappear throughout history. So it is expected to crusade against unlettered people. The scheme of research adopted in this study was a minority race who habitate at the northern Machan, Chiangrai in Thailand. It is not only analysis of language but also the eradication of literacy and the research based on linguistic, ethnolinguistic, and primary oral culture. Five Pulang people who live in that area were chosen for creating letters. By using the I. P. A., after each word was listen to their pronunciation one by one it was described and repeated this process several times; the material words and humanbody were pointed in front of them while other words were described by gesture. For final description, number of people were in the lineup for listening the sound of words and phrases to sentences. In the first stage, it was an analysis segmental of Pulang: vocoid, contoid and diphthong were described with each sample syllables and words. The suprasegmental were studied with intonation and juncture of the words in the second stage. Two words were compared and different meanings within their intonation and juncture were shown. At the end of this part, each case of phonemic or morphophonemics representation described the juncture in the words. In the third stage, minimal pairs were analyzed with vowels and consonants and described in free variation based on words. In the last stage, syllable structure in open syllable and closed syllable was studied and then each syllable of its structure was analyzed with samples. There were thirty-two phonemes in apong Pulang as follows: seven vocoids; a, i, e, o, u, ${\ae}$, and $\wedge$, one diphthong; wu, 24 contoids; b, c, d, f, g, h, j, k, k, 1, m, n, ${\eta}, {\;}p^{h}$, p, p, r, s, s, sh, t, t, w, and y. Their pronunciations of p, s, d, $p^{h}$, j, and t are frequently used in speech and are unique in triphthong. Moreover, most of the words used initial and final consonant cluster.

  • PDF