Search | Korea Science

Statistical Analysis of Korean Phonological Variations Using a Grapheme-to-phoneme System (발음열 자동 생성기를 이용한 한국어 음운 변화 현상의 통계적 분석)

이경님;정민화
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.7
- /
- pp.656-664
- /
- 2002
We present a statistical analysis of Korean phonological variations using a Grapheme-to-Phoneme (GPT) system. The GTP system used for experiments generates pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived form morphophonological analysis and government standard pronunciation rules. The GTP system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonant's, These statistics can be used for improving the performance of speech recognition systems.
PDF KSCI

Perceptual-phonemic Contrasts of Single-word Intelligibility for Testing Korean Dysarthric Speech (뇌성마비로 인한 마비말장애의 음소대조 낱말명료도와 문장명료도)

김수진
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.8
- /
- pp.694-702
- /
- 2003
The word intelligibility test for dysarthric speakers was designed to examine phonetic contrasts that are likely (1) to be sensitive to intelligibility impairment and (2) to contribute significantly to speech intelligibility. These phonetically contrasting word pairs were tested and proved to be reliable and to be valid, The results showed that in Korean dysarthric patients, the percentage of error in final position contrast was higher than in any other position. Unlike the results of previous studies, the initial-position contrasts were crucial in predicting the overall intelligibility among Korean patients.
PDF KSCI

Prosodic Contour Generation for Korean Text-To-Speech System Using Artificial Neural Networks

Lim, Un-Cheon
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.2E
- /
- pp.43-50
- /
- 2009
To get more natural synthetic speech generated by a Korean TTS (Text-To-Speech) system, we have to know all the possible prosodic rules in Korean spoken language. We should find out these rules from linguistic, phonetic information or from real speech. In general, all of these rules should be integrated into a prosody-generation algorithm in a TTS system. But this algorithm cannot cover up all the possible prosodic rules in a language and it is not perfect, so the naturalness of synthesized speech cannot be as good as we expect. ANNs (Artificial Neural Networks) can be trained to learn the prosodic rules in Korean spoken language. To train and test ANNs, we need to prepare the prosodic patterns of all the phonemic segments in a prosodic corpus. A prosodic corpus will include meaningful sentences to represent all the possible prosodic rules. Sentences in the corpus were made by picking up a series of words from the list of PB (phonetically Balanced) isolated words. These sentences in the corpus were read by speakers, recorded, and collected as a speech database. By analyzing recorded real speech, we can extract prosodic pattern about each phoneme, and assign them as target and test patterns for ANNs. ANNs can learn the prosody from natural speech and generate prosodic patterns of the central phonemic segment in phoneme strings as output response of ANNs when phoneme strings of a sentence are given to ANNs as input stimuli.
PDF KSCI

Aerodynamic Characteristics of Korean Bilabial Stop Consonant as a Function of Phonemic Position in a Syllable (음절내 음소 출현 위치에 따른 한국어 양순 파열음의 공기역학적인 특징)

Park, Sang-Hee;Jeong, Haeng-Im;Jeong, Ok-Ran;Seok, Dong-Il
- Speech Sciences
- /
- v.9 no.4
- /
- pp.59-75
- /
- 2002
Aerodynamic analysis study was performed on 14 normal subjects (2 males, 12 females) with nonsense syllables composed of Korean bilabial stops (/p, p', $p^{h}$) and their preceding and/or following vowels, /i, a, u/. That is, [pi, p'i, $p^{h}i$, pa, p'a, $p^{h}a$, pu, p'u, $p^{h}u$, ipi, apa, upu, $ip^{h}i$, $ap^{h}a$, $up^{h}u$, ip'i, ap'a, up'u]. All measures were taken and analysed using Aerophone II voice function analyzer and included peak air pressure, mean air pressure, maximum flow rate, volume, mean SPL and phonatory SPL. A t-test and one-way ANOVA were employed for analysis. A post-hoc analysis was performed with Scheffe and Bonferroni. The results were as follows: First, MSPL. and MAP of /p, p', $p^{h}$/ were significantly different in different positions (initial and medial position). In addition, different vowel environment also produced significantly different aerodynamic characteristics those consonants. Especially the lax consonant /p/ was significantly different /i, a, u/ vowel environments. The tense consonant /p'/ was significantly different only /i/ vowel environment.
PDF

The Vowel System of American English and Its Regional Variation (미국 영어 모음 체계의 몇 가지 지역 방언적 차이)

Oh, Eun-Jin
- Speech Sciences
- /
- v.13 no.4
- /
- pp.69-87
- /
- 2006
This study aims to describe the vowel system of present-day American English and to discuss some of its phonetic variations due to regional differences. Fifteen speakers of American English from various regions of the United States produced the monophthongs of English. The vowel duration and the frequencies of the first and the second formant were measured. The results indicate that the distinction between the vowels [c] and [a] has been merged in most parts of the U.S. except in some speakers from eastern and southeastern parts of the U.S., resulting in the general loss of phonemic distinction between the vowels. The phonemic merger of the two vowels can be interpreted as the result of the relatively small functional load of the [c]-[a] contrast, and the smaller back vowel space in comparison to the front vowel space. The study also shows that the F2 frequencies of the high back vowel [u] were extremely high in most of the speakers from the eastern region of the U.S., resulting in the overall reduction of their acoustic space for high vowels. From the viewpoint of the Adaptive Dispersion Theory proposed by Liljencrants & Lindblom (1972) and Lindblom (1986), the high back vowel [u] appeared to have been fronted in order to satisfy the economy of articulatory gesture to some extent without blurring any contrast between [i] and [u] in the high vowel region.
PDF

A Neural Network Based Korean Segmental Duration Modeling Using Tonal Information of Phonemes (음소별 성조 정보를 이용한 신경망 기반의 한국어 음소 지속시간 모델링)

김은경;이상호;오영환
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.6
- /
- pp.84-88
- /
- 1999
The accurate estimation of segmental duration is crucial for natural-sounding text-to-speech synthesis. For predicting Korean segmental durations, conventional methods utilized phonemic context, part-of-speech context and locational information in prosodic phrase. In this paper, the tonal information of phonemes is employed for more accurate prediction. After defining two non-boundary tones and six boundary tones, we annotated the tonal label on each syllable of 400 sentences. To predict segmental duration using tonal information, we constructed neural networks with a real-valued output node predicting phonemic duration and trained them by backpropagation algorithm. Experimental results showed that the proposed features are effective for predicting Korean segmental durations, and we got 0.863 correlation coefficient of the observed durations and predicted ones.
PDF

An Acoustic Study of English Non-Phoneme Schwa and the Korean Full Vowel /e/

Ahn, Soo-Woong
- Speech Sciences
- /
- v.7 no.4
- /
- pp.93-105
- /
- 2000
The English schwa sound has special characteristics which are distinct from other vowels. It is non-phonemic and occurs only in an unstressed syllable. Compared with the English schwa, the Korean /e/ is a full vowel which has phonemic contrast. This paper had three aims. One was to see whether there is any relationship between English full vowels and their reduced vowel schwas. Second was to see whether there is any possible target in the English schwa sounds which are derived from different full vowels. The third was to compare the English non-phoneme vowel schwa and the Korean full vowel /e/ in terms of articulatory positions and duration. The study results showed that there is no relationship between each of the full vowels and its schwa. The schwa tended to converge into a possible target which was F1 456 and F2 1560. The Korean vowel /e/ seemed to have its distinct position speaker-individual which is different from the neutral tongue position. The evidence that the Korean /e/ is a back vowel was supported by the Seoul dialect speaker. In duration, the English schwa was much shorter than the full vowels, but there was no significant difference in length between the Korean /e/ and other Korean vowels.
PDF

GENERATION OF MULTI-SYLLABLE NONSENSE WORDS FOR THE ASSESSMENT OF KOREAN TEXT-TO SPEECH SYSTEM (한국어 문장음성합성 시스템의 평가를 위한 다음절 무의미단어의 생성 및 평가에 관한 연구)

조철우
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.338-341
- /
- 1994
In this paper we propose a method to generate a multisyllable onsense wordest for the purpose of synthetic speech assessment and applies th ewordest to assess one commercial text-to-speech system. Some results about the experiment is suggested and it is verified that the generated nonsense wordset can be used to assess the intelligibility of the synthesizer in phoneme level or in phonemic environmental level. From the experimental results it is verified that such multi-syllable nonsense wordset can be useful for the assessment of synthesized speech.
PDF

A Study on the Classification of the Korean Consonants in the VCV Speech Chain (VCV 연쇄음성상에 존재하는 한국어 자음의 분류에 관한 연구)

최윤석;김기석;김원준;황희영
- The Transactions of the Korean Institute of Electrical Engineers
- /
- v.39 no.6
- /
- pp.607-615
- /
- 1990
In this paper, I propose the experimental models to classify the consonants in the Vowel-Consonant-Vowel (VCV) speech chain into four phonemic groups such as nasals, liquids, plosives and the others. To classify the fuzzy patterns like speech, it is necessary to analyze the distribution of acoustic feature of many training data. The classification rules are maximum 4 th order polynomial functions obtained by regression analysis, contributing collectively the result. The final result shows about 87% success rates with the data spoken by one man.

THE COMPARISON OF LONG VOWELS IN KOREAN AND TURKISH (한국어와 터어키어의 장모음에 관한 연구)

Turker, Ebru
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.529-533
- /
- 1996
There are some similar phonological properties shared by different languages. The phenomenon of vowel length is just one of them which shows distinctive futures. In some languages long vowels serve to differentiate meanings. In that case the phonological context it creates is important and so it has to be incorporated into the phonemic inventory of the language, otherwise there will be misunderstanding. In this paper I will try to explain the Turkish vowel system as well as the Korean, and then to show how long vowels take their forms in Turkish and Korean.
PDF

Search Result 96, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)