Search | Korea Science

Automatic Phonetic Segmentation of Korean Speech Signal Using Phonetic-acoustic Transition Information (음소 음향학적 변화 정보를 이용한 한국어 음성신호의 자동 음소 분할)

박창목;왕지남
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.8
- /
- pp.24-30
- /
- 2001
This article is concerned with automatic segmentation for Korean speech signals. All kinds of transition cases of phonetic units are classified into 3 types and different strategies for each type are applied. The type 1 is the discrimination of silence, voiced-speech and unvoiced-speech. The histogram analysis of each indicators which consists of wavelet coefficients and SVF (Spectral Variation Function) in wavelet coefficients are used for type 1 segmentation. The type 2 is the discrimination of adjacent vowels. The vowel transition cases can be characterized by spectrogram. Given phonetic transcription and transition pattern spectrogram, the speech signal, having consecutive vowels, are automatically segmented by the template matching. The type 3 is the discrimination of vowel and voiced-consonants. The smoothed short-time RMS energy of Wavelet low pass component and SVF in cepstral coefficients are adopted for type 3 segmentation. The experiment is performed for 342 words utterance set. The speech data are gathered from 6 speakers. The result shows the validity of the method.
PDF

A System of English Vowel Transcription Based on Acoustic Properties (영어 모음음소의 표기체계에 관한 연구)

Kim, Dae-Won
- Speech Sciences
- /
- v.10 no.4
- /
- pp.73-79
- /
- 2003
There are more than five systems for transcribing English vowels. Because of this diversity, teachers of English and students are confronted with not a little problems with the English vowel symbols used in the English-Korean dictionaries, English text books, books for Phonetics and Phonology. This study was designed to suggest criterions for the phonemic transcription of English vowels on the basis of phonetic properties of the vowels and a system of English vowel transcription based on the criterions in order to minimize the problems with inter-system differences. A speaker (phonetician) of RP English uttered a series of isolated minimal pairs containing the vowels in question. The suggested vowel symbols are as follows: (1) Simple vowels: /i:/ in beat, /I/ bit, /$\varepsilon$/ bet, /${\ae}$ bat, /a:/ father, /Dlla/ bod, /c:/ bawd, /$\upsilon$ put, /u:/ boot /$\Lambda$/ but, and /e/ about /$\varepsilon:ll3:r$/ bird. (2) Diphthongs: /aI/ in bite, /a$\upsilon$/ bout, /cI/ boy, /3$\upsilon$llo$\upsilon$/ boat, /eI/ bait, /eelleer/ air, /uelluer/ poor, /iellier/ beer. Where two symbols are shown corresponding to the vowel in a single word, the first is appropriate for most speakers of British English and the second for most speakers of American English.
PDF

Using Korean Phonetic Alphabet (KPA) in Teaching English Stop Sounds to Koreans

Jo, Un-Il
- Proceedings of the KSPS conference
- /
- 2000.07a
- /
- pp.165-165
- /
- 2000
In the phoneme level, English stop sounds are classified with the feature of 'voicing': voiceless and voiced (p/b, t/d, k/g). But when realized, a voiceless stop is not alwats the same sound. For example, the two 'p' sounds in 'people' are different. The former is pronounced with much aspiration, while the latter without it. This allophonic differnece between [$P^h$] and [p] out of an English phoneme /p/ can be well explained to Koreans because in Korean these two sounds exist as two different phonemes {/ㅍ/ and /ㅃ/ respectively). But difficulties lie in teaching the English voiced stop sounds (/b, d, g/) to Koreans because in Korean voiced stops do not exist as phonemes but as allophones of lenis sounds (/ㅂ, ㄷ, ㄱ/). For example, the narrow transcription of '바보' (a fool) is [baboo]. In the word initial position, Korean lenis stops are pronounced voiceless and even with a slight aspiration while in the inrervocalic environments they become voiced, That is in Korean voiced stops do not occur independently and neither they have their own letters. To explain all these more effectively to Koreans, it is very helpful to use Korean Phenetic Alphabet (KPA) which is devised by Dr. LEE Hyunbok (a professor of phonetics at Seoul National Univ. and chairman of Phonetic Society of Koera.)(omitted)
PDF

A Pull System of the IPA in the Arae Hangul Wordprocessor (글틀<한글>에서 정밀 국제 음성 기호 쓰기)

KIM Zong-Su
- MALSORI
- /
- no.19_20
- /
- pp.29-32
- /
- 1990
The Arae Hangul wordprocessor is one of the most famous software in Korea. It is very useful to linguists In that it has various systems of foreign characters and enables us to edit enough new ones freely. It has already a broad system of the IPA, which cannot be used for narrow transcription. The writer proposes here a full system of the IPA revised to 1979, which you can get from the Phonetic Society of Korea or Hangul & Computer Co., Ltd., free of charge. To use the new system of the IPA in the Arae Hangul wordprocessor, you must replace six revised files: (1) SPECIAL2.SFT, (2) SPECIAL2.PFT, (3)SPECIAL2.PFT, (4)ASCII.SFT, (5)ASCII.PFT, (6)ASCII.LFT.
PDF

Consonant Inventories of the Better Cochlear Implant Children in Korea (말지각 능력이 우수한 인공와우 착용 아동들의 조음 능력;음소의 정밀 전사)

Chang, Son-A;Kim, Su-Jin;Sin, Ji-Yeong
- Proceedings of the KSPS conference
- /
- 2007.05a
- /
- pp.274-277
- /
- 2007
The purpose of this study is 1) to describe the phoneme inventories of cochlear implant(CI) children and 2) to describe their utterances using narrow phonetic transcription method. All the subjects had more than 2 year-experience with CI and showed more than 87% open-set sentence perception abilities. Average consonant accuracy was 81.36% and it was improved up to 87.41% when distortion errors were not counted. They showed different error patterns from hearing aid users. The prominent error pattern was weakening of consonants.
PDF

Consecutive Vowel Segmentation of Korean Speech Signal using Phonetic-Acoustic Transition Pattern (음소 음향학적 변화 패턴을 이용한 한국어 음성신호의 연속 모음 분할)

Park, Chang-Mok;Wang, Gi-Nam
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.10a
- /
- pp.801-804
- /
- 2001
This article is concerned with automatic segmentation of two adjacent vowels for speech signals. All kinds of transition case of adjacent vowels can be characterized by spectrogram. Firstly the voiced-speech is extracted by the histogram analysis of vowel indicator which consists of wavelet low pass components. Secondly given phonetic transcription and transition pattern spectrogram, the voiced-speech portion which has consecutive vowels automatically segmented by the template matching. The cross-correlation function is adapted as a template matching method and the modified correlation coefficient is calculated for all frames. The largest value on the modified correlation coefficient series indicates the boundary of two consecutive vowel sounds. The experiment is performed for 154 vowel transition sets. The 154 spectrogram templates are gathered from 154 words(PRW Speech DB) and the 161 test words(PBW Speech DB) which are uttered by 5 speakers were tested. The experimental result shows the validity of the method.
PDF

Morpheme-based Korean broadcast news transcription (형태소 기반의 한국어 방송뉴스 인식)

Park Young-Hee;Ahn Dong-Hoon;Chung Minhwa
- Proceedings of the KSPS conference
- /
- 2002.11a
- /
- pp.123-126
- /
- 2002
In this paper, we describe our LVCSR system for Korean broadcast news transcription. The main focus is to find the most proper morpheme-based lexical model for Korean broadcast news recognition to deal with the inflectional flexibilities in Korean. There are trade-offs between lexicon size and lexical coverage, and between the length of lexical unit and WER. In our system, we analyzed the training corpus to obtain a small 24k-morpheme-based lexicon with 98.8％ coverage. Then, the lexicon is optimized by combining morphemes using statistics of training corpus under monosyllable constraint or maximum length constraint. In experiments, our system reduced the number of monosyllable morphemes from 52％ to 29％ of the lexicon and obtained 13.24％ WER for anchor and 24.97％ for reporter.
PDF

Prosodic Features at "Sentence Boundaries" in Oral Presentations

Umesaki, Atsuko-Furuta
- MALSORI
- /
- no.41
- /
- pp.83-96
- /
- 2001
It is generally said that falling intonation is used at the end of a declarative sentence. However, this is not the case with all stretches of spontaneous speech which are marked in transcription as sentences. The present paper examines intonation patterns appearing at the end of declarative sentences in oral presentations, and discusses instances where falling intonation does not appear. The texts used for analysis are eight oral presentations collected at international conferences in the field of physics. Quantitative and qualitative analyses are carried out. Three major factors related to discourse structure have been found for non-occurrence of falling intonation at sentence boundaries.
PDF

Prosodic Features at "Sentence Boundaries" in Oral Presentations

Umesaki, Atsuko-Furuta
- Proceedings of the KSPS conference
- /
- 2000.07a
- /
- pp.149-164
- /
- 2000
It is generally said that falling intonation is used at the end of a declarative sentence. However, this is not the case with all stretches of spontaneous speech which are marked in transcription as sentences. The present paper examines intonation patterns appearing at the end of declarative sentences in oral presentations, and discusses instances where falling intonation does not appear. The texts used for analysis are eight oral presentations collected at international conferences in the field of physics. Quantitative and qualitative analyses are carried out. Three major factors related to discourse structure have been found for nonoccurrence of falling intonation at sentence boundaries.
PDF

The design and implementation of Web Component for Korean to Roman transcription (국어 로마자 전사표기 웹 컴포넌트 설계 및 구현)

Kim Hongsop
- Journal of the Korea Society of Computer and Information
- /
- v.9 no.4 s.32
- /
- pp.71-76
- /
- 2004
In this paper, a web-based automatic transcription component is designed and implemented for mechanical conversion of revised Korean-to-Romanization notation rule. Specially. we proposes system architecture and algorithms that transcript Korean to Roman automatically after transliterate Korean to phonetic symbol applying phonological principles. The components operate under the web server's script mechanism. and the dictionary for exceptional usage is designed as an accessorial function supported either operating at web server internally or externally. The overall system architecture is presented by UML. specification and pseudo code. The proposed architecture can be implemented in encapsulated service by object oriented component and that can be easily adapted and modified on the internet environment and this system may have many advantages to improve efficiency, library reuse. extensibility at software development.
PDF

Search Result 50, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)