Search | Korea Science

Ambiguity Types of the Homonymic & Heterographic Units for Improving Korean Voice Recognition System - a Preliminary Research (한국어 음성인식 시스템 향상을 위한 동음이철 단위의 중의성 유형 분류)

Yoon, Ae-Sun;Kang, Mi-Young
- Speech Sciences
- /
- v.15 no.4
- /
- pp.67-81
- /
- 2008
The accuracy rate of P2G (Phoneme-to-Grapheme) is one of the important factors determining the quality of unlimited voice recognition (VR) systems. Few studies were, however, conducted to reduce ambiguities of a phoneme string which can be segmented into a variety of different linguistic units (i.e. morphemes, words, eo-jeols), thus be transformed into more than one grapheme string. This paper is a preliminary research for building a large knowledge base of those homonymic & heterographic units(HHUs), which will provide unlimited Korean VR systems with more accurate P2G information. This paper analyzes 2 main factors generating HHUs: (1) boundary determination of the prosodic unit; (2) its segmentation into linguistic units. In this paper, linguistic characteristics determining variable boundaries of a prosodic unit are investigated, and the ambiguity types of HHUs are classified in accordance with their morphological and syntactic structures as well as with the phonological rules governing them.
PDF

Phonetics and Language as a formal System

Port, Robert F.;Leary, Adam P.
- Lingua Humanitatis
- /
- v.5
- /
- pp.221-264
- /
- 2003
This paper takes issue with the idea of language as a 'serial-time structure' as opposed to the 'real-time event' of speech, an idea entrenched in Chomskyan model of linguistic theory. The discussion centers around the leitmotif question: Is language constructed entirely from a finite set of apriori discrete symbol types, as the 'competence vs performance' dichotomy implies\ulcorner A set of linguistic patterns examined in this study, largely with regard to phonological considerations, points to the evidence to the contrary. That is, while the patterns may be said to be linguistically distinct, they are not discretely, different, i.e. not different enough to be reliably differentiated. It is demonstrated that much of current research in phonology, including the most recent Optimality Theory, is misdirected in that it falsely presupposes a discrete universal phonetic inventory. The main thrust of the present study is that there is no sharp boundary between 'competence' defined as the formal, symbolic, discrete time domain of language and human cognition on the one hand and 'performance' as the continuous, fuzzy, real-time domain of human physiology on the other.
PDF

The Study on the Expential Smoothing Method of the Concatenation Parts in the Speech Waveform (음성 파형분절의 지수함수 스므딩 기법에 관한 연구)

박찬수
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1991.06a
- /
- pp.7-10
- /
- 1991
In a text-to-speech system, sound units (phonemes, words, or phrases, etc.) can be concatenated together to produce required utterance. The quality of the resulting speech is dependent on factors including the phonological/prosodic contour, the quality of basic concatenation units, and how well the units join together. Thus although the quality of each basic sound unit is high, if occur the discontinuity in the concatenation part then the quality of synthesis speech is decrease. To solve this problem, a smoothing operation should be carried out in concatenation parts. But a major problem is that, as yet, no method of parameter smoothing is available for joining the segment together. Thus in this paper, we proposed a new aigorithm that smoothing the unnatural discountinuous parts which can be occured in speech waveform editing. This algorithm used the exponential smoothing method.
PDF

The Smoothing Method of the Concatenation Parts in Speech Waveform by using the Forward/Backward LPC Technique (전, 후방향 LPC법에 의한 음성 파형분절의 연결부분 스므딩법)

이미숙
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1991.06a
- /
- pp.15-20
- /
- 1991
In a text-to-speech system, sound units (e. q., phonemes, words, or phrases) can be concatenated together to produce required utterance. The quality of the resulting speech is dependent on factors including the phonological/prosodic contour, the quality of basic concatenation units, and how well the units join together. Thus although the quality of each basic sound unit is high, if occur the discontinuity in the concatenation part then the quality of synthesis speech is decrease. To solve this problem, a smoothing operation should be carried out in concatenation parts. But a major problem is that, as yet, no method of parameter smoothing is availalbe for joining the segment together.
PDF

Extra Vowel Addition Produced in Korean Students' English Pronunciation of Word-final Stop Consonants (영어 폐쇄자음 발음 뒤에 나타나는 모음추가 현상)

Hwang, Young-Soon
- Speech Sciences
- /
- v.7 no.4
- /
- pp.169-186
- /
- 2000
This paper aims to confirm the mispronunciation of native Korean students due to the phonetic and phonological system differences between English and Korean, and to find the works-to-do by experiment. Many Korean students tend to differentiate the sounds of word-final stop consonants not by vowel duration or the allophones but by the phoneme of the consonant itself. In English, Stop sounds change through the conditions of the aspirated, unaspirated, or unreleased sounds. But in Korean they are not allophones of phonemes but distinct phonemes. Therefore, many Korean students are apt to add an extra vowel sound /i/ after the final stop consonant in the eve form due to both the unperception of the differences between the phonemes and the allophones of stop consonants, and the influence of the Korean sound-sequence relationship. Since the replacement of the allophones and extra vowel addition does not change the meaning, the importance was almost lost. Nevertheless, this kind of study is essential for the precise learning and the use of the English language.
PDF

Acoustic, Intraoral Air Pressure and EMG Studies of Vowel Devoicing in Korean

Kim, Hyun-Gi;Niimi, Sei-Ji
- Speech Sciences
- /
- v.10 no.1
- /
- pp.3-13
- /
- 2003
The devoicing vowel is a phonological process whose contrast in sonority is lost or reduces in a particular phonetic environment. Phonetically, the vocal fold vibration originates from the abduction/adduction of the glottis in relation to supraglottal articulatory movements. The purpose of this study is to investigate Korean vowel devoicing by means of experimental instruments. The interrelated laryngeal adjustments and aerodynamic effects for this voicing can clarify the redundant articulatory gestures relevant to the distinctive feature of sonority. Five test words were selected, being composed of the high vowel /i/, between the fricative and strong aspirated or lenis affricated consonants. The subjects uttered the test words successively at a normal or at a faster speed. The EMG, the sensing tube Gaeltec S7b and the High-Speech Analysis system and MSL II were used in these studies. Acoustically, three different types of speech waveforms and spectrograms were classified, based on the voicing variation. The intraoral air pressure curves showed differences, depending on the voicing variations. The activity patterns of the PCA and the CT for devoicing vowels appeared differently from those showing the partially devoicing vowels and the voicing vowels.
PDF

The design and implementation of Web Component for Korean to Roman transcription (국어 로마자 전사표기 웹 컴포넌트 설계 및 구현)

Kim Hongsop
- Journal of the Korea Society of Computer and Information
- /
- v.9 no.4 s.32
- /
- pp.71-76
- /
- 2004
In this paper, a web-based automatic transcription component is designed and implemented for mechanical conversion of revised Korean-to-Romanization notation rule. Specially. we proposes system architecture and algorithms that transcript Korean to Roman automatically after transliterate Korean to phonetic symbol applying phonological principles. The components operate under the web server's script mechanism. and the dictionary for exceptional usage is designed as an accessorial function supported either operating at web server internally or externally. The overall system architecture is presented by UML. specification and pseudo code. The proposed architecture can be implemented in encapsulated service by object oriented component and that can be easily adapted and modified on the internet environment and this system may have many advantages to improve efficiency, library reuse. extensibility at software development.
PDF

Prosodic Annotation in a Thai Text-to-speech System

Potisuk, Siripong
- Proceedings of the Korean Society for Language and Information Conference
- /
- 2007.11a
- /
- pp.405-414
- /
- 2007
This paper describes a preliminary work on prosody modeling aspect of a text-to-speech system for Thai. Specifically, the model is designed to predict symbolic markers from text (i.e., prosodic phrase boundaries, accent, and intonation boundaries), and then using these markers to generate pitch, intensity, and durational patterns for the synthesis module of the system. In this paper, a novel method for annotating the prosodic structure of Thai sentences based on dependency representation of syntax is presented. The goal of the annotation process is to predict from text the rhythm of the input sentence when spoken according to its intended meaning. The encoding of the prosodic structure is established by minimizing speech disrhythmy while maintaining the congruency with syntax. That is, each word in the sentence is assigned a prosodic feature called strength dynamic which is based on the dependency representation of syntax. The strength dynamics assigned are then used to obtain rhythmic groupings in terms of a phonological unit called foot. Finally, the foot structure is used to predict the durational pattern of the input sentence. The aforementioned process has been tested on a set of ambiguous sentences, which represents various structural ambiguities involving five types of compounds in Thai.
PDF

Fiberscopic and Electromyograpic Study on Laryngeal Adjustments for Syllable-final Applosives in Korean (한국어의 음절말 내파음의 후두조절 -화이비스코프 및 근전도에 의한 관찰-)

Park, Hea-Suk
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.16 no.1
- /
- pp.53-67
- /
- 2005
It is known that Korean stop consonants in syllable-initial position are of three types : lax, aspirated and forced (or unaspirated). In syllable-final position, however, these three different types are merged to a single type with the same place of articulation, although the original three-way distinction is preserved in Korean orthographic (Hangul) system. Thus the syllable-final stops are phonetically realized as voiceless "applosives" which are characterized by the absence of oral release. The aim of the present study is to investigate the laryngeal adjustments for these syllable-final stops in various phonological conditions by using fiberscope, and, is to further investigate electromyographically the laryngeal adjustments for Korean stops both in the syllable-initial and final positions in various phonological conditions. The results can be summarized as follows : 1. In the case of syllable-initial stops, the glottal widths in each three types of the Korean stops during the articulatory closure are clearly different. And the pattern of thyroarytenoid(VOC) activity appeared to characterize the three different types of Korean stops. 2. The basic laryngeal feature of the Korean syllable-final applosives is characterized by a small degree of glottal opening which begins at or slightly after the oral closure. 3. In the case, syllable-final stop followed by the copula "ita", the syllable- final stop is pronounced as the stop consonant at the initial position of the following syllable containing the vowel[i], the underlying features of three-way distinction for the stops in the Korean orthographic(Hangul) system being manifested at the laryngeal adjustment. 4. In the case of the final applosives followed by the initial stops and fricatives, the laryngeal feature of the final applosives appears to be assimilated by that of the following consonant irrespective of the difference in the place of articulation, as far as the glottal abduction/adduction is concerned. It is clearly demonstrated in the case of syllable-initial stop that thyoarytenoid(VOC) activity is suppressed for the production of the stop consonants in question, the degree of which is slightest for the forced type and most marked for the aspirated type, while it is moderate for the lax type.
PDF

Neural Switching Mechanism in the late Korean-English bilinguals by Event-Related fMRI

Kim, Jeong-Seok
- Journal of Biomedical Engineering Research
- /
- v.29 no.4
- /
- pp.272-277
- /
- 2008
Functional MRI technique was used in this study for examining the language switching mechanisms between the first language (L1) and the second language (L2). Language switching mechanism is regarded as a complex task that involves an interaction between L1 and L2. The aim of study is to find out the brain activation patterns during the phonological process of reading real English words and English words written in Korean characters in a bilingual person. Korean-English bilingual subjects were examined while they covertly read four types of words native Korean words, Korean words of a foreign origin, English words written in Korean characters, and English words. The fMRI results reveal that the left hemispheric language-related regions at the brain, such as the left inferior frontal, superior temporal, and parietal cortices, have a greater response to the presentation of English words written in Korean characters than for the other types of words, in addition, a slight difference was observed in the occipital-temporal lobe. These results suggest that a change in the brain circuitry underlying the relational processes of language switching is mainly associated with general executive processing system in the left prefrontal cortex rather than with a similarity-based processing system in the occipital-temporal lobes.
https://doi.org/10.9718/JBER.2008.29.4.272 인용 PDF KSCI

Search Result 77, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)