Search | Korea Science

The Basic Study on making mono-phone for Korean Speech Recognition (한국어 음성 인식을 위한 mono-phone 구성의 기초 연구)

Hwang YoungSoo;Song Minsuck
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.45-48
- /
- 2000
In the case of making large vocabulary speech recognition system, it is better to use the segment than the syllable or the word as the recognition unit. In this paper, we study on the basis of making mono-phone for Korean speech recognition. For experiments, we use the speech toolkit of OGI in U.S.A. The result shows that the recognition rate of :he case in which the diphthong is established as a single unit is superior to that of the case in which the diphthong is established as two units, i.e. a glide plus a vowel. And also, the recognition rate by the number of consonants is a little different.
PDF

Analysis of Acoustic Characteristics of Vowel and Consonants Production Study on Speech Proficiency in Esophageal Speech (식도발성의 숙련 정도에 따른 모음의 음향학적 특징과 자음 산출에 대한 연구)

Choi, Seong-Hee;Choi, Hong-Shik;Kim, Han-Soo;Lim, Sung-Eun;Lee, Sung-Eun;Pyo, Hwa-Young
- Speech Sciences
- /
- v.10 no.3
- /
- pp.7-27
- /
- 2003
Esophageal Speech uses the esophageal air during phonation. Fluent esophageal speakers frequently intake air in oral communication, but unskilled esophageal speakers are difficult with swallowing lots of air. The purpose of this study was to investigate the difference of acoustic characteristics of vowel and consonants production according to the speech proficiency level in esophageal speech. 13 normal male speakers and 13 male esophageal speakers (5 unskilled esophageal speakers, 8 skilled esophageal speakers) with age ranging from 50 to 70 years old. The stimuli were sustained /a/ vowel and 36 meaningless two syllable words. Used vowel is /a/ and consonants were 18 : /k, n, t, m, p, s, c, $C^{h},\;k^{h},\;t^{h},\;p^{h}$, h, I, k', t', p', s', c'/. Fundermental frequency (Fx), Jitter, shimmer, HNR, MPT were measured with by electroglottography using Lx speech studio (Laryngograph Ltd, London, UK). 36 meaningless words produced by esophageal speakers were presented to 3 speech-language pathologists who phonetically transcribed their responses. Fx, Jitter, HNR parameters is significant different between skilled esophageal speakers and unskilled esophageal speakers (P<.05). Considering manner of articulation, ANOVA showed that differences in two esophageal speech groups on speech proficiency were significant; Glide had the highest number of confusion with the other phoneme class, affricates are the most intelligible in the unskilled esophageal speech group, whereas in the skilled esophageal speech group fricatives resulted highest number of confusions, nasals are the most intelligible. In the place of articulation, glottal /h/ is the highest confusion consonant in both groups. Bilabials are the most intelligible in the skilled esophageal speech, velars are the most intelligible in the unskilled esophageal speech. In the structure of syllable, 'CV+V' is more confusion in the skilled esophageal group, unskilled esophageal speech group has similar confusion in both structures. In unskilled esophageal speech, significantly different Fx, Jitter, HNR acoustic parameters of vowel and the highest confusions of Liquid, Nasals consonants could be attributed to unstable, improper contact of neoglottis as vibratory source and insufficiency in the phonatory air supply, and higher motoric demand of remaining articulation due to morphological characteristics of vocal tract after laryngectomy.
PDF

A Comparative Study on Modelling Readability Formulas: Focus on Primary and Secondary Textbooks (텍스트의 언어적 난이도 측정 공식 비교 연구 - 초중고 교과서를 중심으로 -)

Choe, In-Sook
- Journal of the Korean Society for information Management
- /
- v.22 no.4 s.58
- /
- pp.173-195
- /
- 2005
The purpose of this study is to clarify whether readability formulas based on linguistic factors are suitable for secondary and older primary age texts. A comparison among fomulas for primary age texts, some for both primary and secondary age, and some for secondary age revealed that exclusive ones for narrow age range were more effective. A model estimating readability scores from the average number of sentences in paragraphs or a model with two factors, the average number of sentences and paragraphs in texts was found to be good one for secondary age. While a model based on total number of unique syllables or a model from total number of unique syllables and new syllable occurrence ratio was good for primary age.
https://doi.org/10.3743/KOSIM.2005.22.4.173 인용 PDF

Statistical Survey of Vocabulary in Korean Textbook for Elementary School 6th-Grade (초등학교 6학년 국어교과서의 어휘 통계조사)

Kim, Jong-Young;Kim, Cheol-Su
- The Journal of the Korea Contents Association
- /
- v.12 no.5
- /
- pp.515-524
- /
- 2012
This paper studied the statistics such as the total number of syllables, the kinds of syllables, the frequency of syllables, the number of eojeols(word phrases unique in Korean language), the kinds of eojeols, average length of eojeols, the frequency of eojeols and the parts of speech in four different Korean textbooks for 6th-grade students(6-1 Korean Reading, 6-1 Korean Speaking Listening Writing, 6-2 Korean Reading and 6-2 Korean Speaking Listening Writing). The results of the statistical survey are as follows: the number of Hangul syllables was 194,683; the kinds of syllables were 1,290; the average frequency of syllables was 150.9; the number of eojeol was 70,185; the kinds of eojeol were 22,647; the average frequency of eojeol was 3.1; the average length of eojeols was 2.8 syllables, the longest one consist of 10 syllables. In parts of speech, nouns are used more in the Korean Reading textbook, and verbs are used more in Korean Speaking Listening Writing.
https://doi.org/10.5392/JKCA.2012.12.05.515 인용 PDF KSCI

Heteronyms in modern Korean and their transcription in the IPA and the Roman alphabet (우리말 동철이음어(同綴異音語) IPA.로마자 표기 (사~섬))

Youe MahnGunn
- MALSORI
- /
- no.37
- /
- pp.49-71
- /
- 1999
The Purpose of this paper is to gather pairs of heteronyms in modern Korean and transcribe them in the IPA and the Roman alphabet in order to propose that all of them should be differentiated in Hanngul orthography. More than a quarter of the whole Korean vocabulary consists of words with a long vowel and the number of minimal pairs distinguished only by the chroneme reaches nearly ten thousand (i.e. twenty thousand words). The letter h syllable-finally is used here to represent the long vowel in Romanization except the vowel '으‘[?:] which is transcribed by doubling the letter u (i.e. uu). Another factor bringing forth lots of heteronyms in Korean is the lack of full indication as to the non-automatic reinforcement in the initial consonant of a word (or a morpheme) when preceded by another within a phrase (or a word). These reinforced word-initial consonants are written with the letter c and an apostrophe (like c'g- , c'd- , c'b-, c's-, c'j-) in Romanization here. The reinforced morpheme-initial consonant within a word is written with the letters k t, p, ss and cz for ㄲ, ㄸ, ㅃ, ㅆ and ㅉ sounds respectively. The contrasted pronunciations of pairs of heteronyms beginning with ㅅ /s/sup h// and ㅆ /s/ sounds are transcribed here for exemplification.
PDF

Perception of the English Epenthetic Stops by Korean Listeners

Han, Jeong-Im
- Speech Sciences
- /
- v.11 no.1
- /
- pp.87-103
- /
- 2004
This study investigates Korean listeners' perception of the English stop epenthesis between the sonorant and fricative segments. Specifically this study investigates 1) how often English epenthetic stops are perceived by native Korean listeners, given the fact that Korean does not allow consonant clusters in codas; and 2) whether perception of the epenthetic stops, which are optional phonetic variations, not phonemes, could be improved without any explicit training. 120 English non-words with a mono-syllable structure of CVC1C2, where C1=/m, n, $\eta$, 1/, and C2=/s, $\theta$, $\int$/, were given to two groups of native Korean listeners, and they were asked to detect the target stops such as [p], [t], and [k]. The number of their responses were computed to determine how often listeners succeed in recovering the string of segments produced by the native English speaker. The results of the present study show that English epenthetic stops are poorly identified by native Korean listeners with low English proficiency, even in the case where stimuli with strong acoustic cues are provided with, but perception of epenthetic stops is closely related with listeners' English proficiency, showing the possibility of the improvement of perception. It further shows that perception of epenthetic stops shows asymmetry between coronal and non-coronal consonants.
PDF

The Effect of Respiration and Articulator Training Programs on Basic Ability of Speech Production in Cerebral Palsy Children (호흡 및 조음기관 훈련 프로그램이 뇌성마비아동의 말 산출 기초능력에 미치는 효과)

Lee, Gum-Suk;Yoo, Jae-Yeon
- Speech Sciences
- /
- v.15 no.3
- /
- pp.103-116
- /
- 2008
Cerebral palsy children represent abnormal vocalization pattern caused by respiration problem and paralyzed oral motor muscle that are the basics of speech production. Thus, this study examined the effect of respiration and articulator training programs on the basic ability of speech production in CP children. The subjects of this study were 4 children with 3 of spastic CP and 1 of ataxia CP. The respiration and articulator program was conducted in 30 sessions for 30 minutes each. Pre-test was administered twice before the program, ongoing test was administered every 5 session during the period of experiment, and post-test was administered twice. The program included speech production such as respiration training, lips, jaw, cheek, and tongue exercise, and velopharyngeal training, and related articulator training. The following results were obtained. First, all subject children were less than 5 seconds in maximum phonation time before the experiment and 2 were improved by more than 4$\sim$5 seconds during the experiment, but 2 had relatively low rising width. Second, while children with less than 30dB before the experiment became bigger in strength during the experiment, children with more than 35dB before the experiment showed a minor change. Subject child 4 had lower vocal strength in the post-test period. Finally, although each subject had individual difference in syllable diadochokinetic ability, the function was improved and the number of repetition in one respiration was also increased.
PDF

The Primitive Representation in Speech Perception: Phoneme or Distinctive Features (말지각의 기초표상: 음소 또는 변별자질)

Bae, Moon-Jung
- Phonetics and Speech Sciences
- /
- v.5 no.4
- /
- pp.157-169
- /
- 2013
Using a target detection task, this study compared the processing automaticity of phonemes and features in spoken syllable stimuli to determine the primitive representation in speech perception, phoneme or distinctive feature. For this, we modified the visual search task(Treisman et al., 1992) developed to investigate the processing of visual features(ex. color, shape or their conjunction) for auditory stimuli. In our task, the distinctive features(ex. aspiration or coronal) corresponded to visual primitive features(ex. color and shape), and the phonemes(ex. /$t^h$/) to visual conjunctive features(ex. colored shapes). The automaticity is measured by the set size effect that was the increasing amount of reaction time when the number of distracters increased. Three experiments were conducted. The laryngeal features(experiment 1), the manner features(experiment 2), and the place features(experiment 3) were compared with phonemes. The results showed that the distinctive features are consistently processed faster and automatically than the phonemes. Additionally there were differences in the processing automaticity among the classes of distinctive features. The laryngeal features are the most automatic, the manner features are moderately automatic and the place features are the least automatic. These results are consistent with the previous studies(Bae et al., 2002; Bae, 2010) that showed the perceptual hierarchy of distinctive features.
https://doi.org/10.13064/KSSS.2013.5.4.157 인용 PDF

Phonological processes of vowels from orthographic to pronounced words in the Buckeye Corpus by sex and age groups

Yang, Byunggon
- Phonetics and Speech Sciences
- /
- v.10 no.2
- /
- pp.25-31
- /
- 2018
This paper investigated the phonological processes of monophthongs and diphthongs in the pronounced words present in the Buckeye Corpus and compared the frequency distribution of these processes by sex and age groups to provide a clearer understanding of spoken English to linguists and phoneticians. Both orthographic and pronounced words were extracted from the transcribed label scripts of the Buckeye Corpus using R. Next, the phonological processes of monophthongs and diphthongs in the orthographic and pronounced labels were tabulated using R scripts, and a frequency distribution by vowel process types, as well as sex and age groups, was created. The results revealed that 95% of the orthographic words contained the same number of syllables, whereas 5% had different numbers of vowels, thereby proving that speakers tend to preserve vowels in spontaneous speech. In addition, deletion processes were preferred in natural speech. Most vowel deletions occurred with an unstressed syllable. Chi-square tests were performed to calculate dependence in the distribution of phonological process types for male and female groups and young and old groups. The results showed a very strong correlation. This finding indicates that vowel processes occurred in approximately the same pattern in natural and spontaneous speech data regardless of sex and age, as well as whether or not the vowel processes were identical. Based on these results, the author concludes that an analysis of phonological processes in spontaneous speech corpora can greatly enhance practical understanding of spoken English.
https://doi.org/10.13064/KSSS.2018.10.2.025 인용 PDF KSCI

A study of phonological regression in 2-6 years of Korean children (서울-경기 지역 2-6세 아동의 발달기적 음운변동에 관한 연구 - 자음을 중심으로 -)

Kim Young-Tae
- MALSORI
- /
- no.21_24
- /
- pp.3-24
- /
- 1992
This study was designed to investigate the changes of phonological processes in normal Korean children aged from 2- to 6-years. Forty eight children who lived in Seoul or Kyung-Ki do were tested with a picture articulation test and their articulation errors including omissions, additions and substitutions were coded into phonological processes. Those phonological processes were discussed in several ways: syllable structure, place, manner, assimilation, tenseness, and aspiration of sounds. Data were analyzed by two ways: (1) number of subjects who showed each process and (2) percentage of occurrence of each process. Analyses in omission-addition processes demonstrated that postvocalic omission occurred most frequently, followed by velar-, alveolar-, and glottal omission. Analyses in substitution processes showed that fronting (palatal and velar), backing (alveolar), and alveolization occurred most frequently in terms of the place of sounds. In terms of assimilation, alveolar-, stopping, and aspiration assimilation occurred frequently. Analyses by the tenseness and aspiration showed similar occurrences among the 4 processes, with slightly higher occurrences in tensing and aspiration than lanxing and deaspiration. All of the processes decreased by age. The numbers of the processes showed by more than half of the children or exceeded 10％ of occurrence were 20 in 2-years of age, 10 in 3-years of age, 1 in 4-years of age, and none in ages of 5 and 6.
PDF

Search Result 84, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)