• 제목/요약/키워드: Syllable Number

검색결과 84건 처리시간 0.026초

한국어 음성 인식을 위한 mono-phone 구성의 기초 연구 (The Basic Study on making mono-phone for Korean Speech Recognition)

  • 황영수;송민석
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 2000년도 학술발표대회 논문집 제19권 2호
    • /
    • pp.45-48
    • /
    • 2000
  • In the case of making large vocabulary speech recognition system, it is better to use the segment than the syllable or the word as the recognition unit. In this paper, we study on the basis of making mono-phone for Korean speech recognition. For experiments, we use the speech toolkit of OGI in U.S.A. The result shows that the recognition rate of :he case in which the diphthong is established as a single unit is superior to that of the case in which the diphthong is established as two units, i.e. a glide plus a vowel. And also, the recognition rate by the number of consonants is a little different.

  • PDF

식도발성의 숙련 정도에 따른 모음의 음향학적 특징과 자음 산출에 대한 연구 (Analysis of Acoustic Characteristics of Vowel and Consonants Production Study on Speech Proficiency in Esophageal Speech)

  • 최성희;최홍식;김한수;임성은;이성은;표화영
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.7-27
    • /
    • 2003
  • Esophageal Speech uses the esophageal air during phonation. Fluent esophageal speakers frequently intake air in oral communication, but unskilled esophageal speakers are difficult with swallowing lots of air. The purpose of this study was to investigate the difference of acoustic characteristics of vowel and consonants production according to the speech proficiency level in esophageal speech. 13 normal male speakers and 13 male esophageal speakers (5 unskilled esophageal speakers, 8 skilled esophageal speakers) with age ranging from 50 to 70 years old. The stimuli were sustained /a/ vowel and 36 meaningless two syllable words. Used vowel is /a/ and consonants were 18 : /k, n, t, m, p, s, c, $C^{h},\;k^{h},\;t^{h},\;p^{h}$, h, I, k', t', p', s', c'/. Fundermental frequency (Fx), Jitter, shimmer, HNR, MPT were measured with by electroglottography using Lx speech studio (Laryngograph Ltd, London, UK). 36 meaningless words produced by esophageal speakers were presented to 3 speech-language pathologists who phonetically transcribed their responses. Fx, Jitter, HNR parameters is significant different between skilled esophageal speakers and unskilled esophageal speakers (P<.05). Considering manner of articulation, ANOVA showed that differences in two esophageal speech groups on speech proficiency were significant; Glide had the highest number of confusion with the other phoneme class, affricates are the most intelligible in the unskilled esophageal speech group, whereas in the skilled esophageal speech group fricatives resulted highest number of confusions, nasals are the most intelligible. In the place of articulation, glottal /h/ is the highest confusion consonant in both groups. Bilabials are the most intelligible in the skilled esophageal speech, velars are the most intelligible in the unskilled esophageal speech. In the structure of syllable, 'CV+V' is more confusion in the skilled esophageal group, unskilled esophageal speech group has similar confusion in both structures. In unskilled esophageal speech, significantly different Fx, Jitter, HNR acoustic parameters of vowel and the highest confusions of Liquid, Nasals consonants could be attributed to unstable, improper contact of neoglottis as vibratory source and insufficiency in the phonatory air supply, and higher motoric demand of remaining articulation due to morphological characteristics of vocal tract after laryngectomy.

  • PDF

텍스트의 언어적 난이도 측정 공식 비교 연구 - 초중고 교과서를 중심으로 - (A Comparative Study on Modelling Readability Formulas: Focus on Primary and Secondary Textbooks)

  • 최인숙
    • 정보관리학회지
    • /
    • 제22권4호통권58호
    • /
    • pp.173-195
    • /
    • 2005
  • 본 연구는 언어적 난이도에 영향을 주는 요인들로 텍스트수준점수 측정공식을 구성하는 방법론이 초등학교 텍스트는 물론 중고등학교 텍스트까지 확장 ·적용될 수 있는지 확인하고 텍스트가 확장됨에 따라 나타나는 새로운 특성을 설명할 수 있는 요인들을 규명하고자 한다. 초중고 텍스트 통합공식, 중고등학교 텍스트 전용공식, 초등학교 텍스트 전용공식을 구성하여 각 공식들의 특징을 비교한 결과 텍스트의 범위를 넓게 잡아 통합 공식을 구성하는 것보다는 소규모 집단으로 분리한 후 전용공식을 구성하는 것이 해당 집단의 특성을 잘 반영하는 우수한 공식을 도출할 수 있는 것으로 확인되었다. 중고등학교 텍스트의 점수를 측정하려면 단락내문장수요인, 문장수 $\cdot$ 단락수요인을 사용하고 초등학교 텍스트의 점수를 측정하려면 이형어절수요인, 이형어절수$\cdot$새어절출현비율요인을 사용하는 것이 효율적이었다.

초등학교 6학년 국어교과서의 어휘 통계조사 (Statistical Survey of Vocabulary in Korean Textbook for Elementary School 6th-Grade)

  • 김종영;김철수
    • 한국콘텐츠학회논문지
    • /
    • 제12권5호
    • /
    • pp.515-524
    • /
    • 2012
  • 본 연구는 초등학교 6학년 국어교과서 4종(6-1 읽기, 6-1 말하기 듣기 쓰기, 6-2 읽기, 6-2 말하기 듣기 쓰기)에 나타나는 어휘들에 대한 통계(전체 음절수, 음절종류, 음절 출현빈도, 어절 개수, 어절 종류, 어절 평균길이, 어절 출현빈도, 품사 등)를 조사하였다. 한글 음절수는 194,683개, 음절종류는 1,290개, 평균 음절 출현빈도는 150.9회이다. 어절 개수는 70,185개, 어절 종류는 22,647개, 어절 평균 출현빈도는 3.1회이다. 평균 음절 길이는 2.8음절이며, 가장 긴 어절은 10음절이다. 품사는 읽기 교과는 명사가 말하기 듣기 쓰기교과는 동사가 약간 많다.

우리말 동철이음어(同綴異音語) IPA.로마자 표기 (사~섬) (Heteronyms in modern Korean and their transcription in the IPA and the Roman alphabet)

  • 유만근
    • 대한음성학회지:말소리
    • /
    • 제37호
    • /
    • pp.49-71
    • /
    • 1999
  • The Purpose of this paper is to gather pairs of heteronyms in modern Korean and transcribe them in the IPA and the Roman alphabet in order to propose that all of them should be differentiated in Hanngul orthography. More than a quarter of the whole Korean vocabulary consists of words with a long vowel and the number of minimal pairs distinguished only by the chroneme reaches nearly ten thousand (i.e. twenty thousand words). The letter h syllable-finally is used here to represent the long vowel in Romanization except the vowel '으‘[?:] which is transcribed by doubling the letter u (i.e. uu). Another factor bringing forth lots of heteronyms in Korean is the lack of full indication as to the non-automatic reinforcement in the initial consonant of a word (or a morpheme) when preceded by another within a phrase (or a word). These reinforced word-initial consonants are written with the letter c and an apostrophe (like c'g- , c'd- , c'b-, c's-, c'j-) in Romanization here. The reinforced morpheme-initial consonant within a word is written with the letters k t, p, ss and cz for ㄲ, ㄸ, ㅃ, ㅆ and ㅉ sounds respectively. The contrasted pronunciations of pairs of heteronyms beginning with ㅅ /s/sup h// and ㅆ /s/ sounds are transcribed here for exemplification.

  • PDF

Perception of the English Epenthetic Stops by Korean Listeners

  • Han, Jeong-Im
    • 음성과학
    • /
    • 제11권1호
    • /
    • pp.87-103
    • /
    • 2004
  • This study investigates Korean listeners' perception of the English stop epenthesis between the sonorant and fricative segments. Specifically this study investigates 1) how often English epenthetic stops are perceived by native Korean listeners, given the fact that Korean does not allow consonant clusters in codas; and 2) whether perception of the epenthetic stops, which are optional phonetic variations, not phonemes, could be improved without any explicit training. 120 English non-words with a mono-syllable structure of CVC1C2, where C1=/m, n, $\eta$, 1/, and C2=/s, $\theta$, $\int$/, were given to two groups of native Korean listeners, and they were asked to detect the target stops such as [p], [t], and [k]. The number of their responses were computed to determine how often listeners succeed in recovering the string of segments produced by the native English speaker. The results of the present study show that English epenthetic stops are poorly identified by native Korean listeners with low English proficiency, even in the case where stimuli with strong acoustic cues are provided with, but perception of epenthetic stops is closely related with listeners' English proficiency, showing the possibility of the improvement of perception. It further shows that perception of epenthetic stops shows asymmetry between coronal and non-coronal consonants.

  • PDF

호흡 및 조음기관 훈련 프로그램이 뇌성마비아동의 말 산출 기초능력에 미치는 효과 (The Effect of Respiration and Articulator Training Programs on Basic Ability of Speech Production in Cerebral Palsy Children)

  • 이금숙;유재연
    • 음성과학
    • /
    • 제15권3호
    • /
    • pp.103-116
    • /
    • 2008
  • Cerebral palsy children represent abnormal vocalization pattern caused by respiration problem and paralyzed oral motor muscle that are the basics of speech production. Thus, this study examined the effect of respiration and articulator training programs on the basic ability of speech production in CP children. The subjects of this study were 4 children with 3 of spastic CP and 1 of ataxia CP. The respiration and articulator program was conducted in 30 sessions for 30 minutes each. Pre-test was administered twice before the program, ongoing test was administered every 5 session during the period of experiment, and post-test was administered twice. The program included speech production such as respiration training, lips, jaw, cheek, and tongue exercise, and velopharyngeal training, and related articulator training. The following results were obtained. First, all subject children were less than 5 seconds in maximum phonation time before the experiment and 2 were improved by more than 4$\sim$5 seconds during the experiment, but 2 had relatively low rising width. Second, while children with less than 30dB before the experiment became bigger in strength during the experiment, children with more than 35dB before the experiment showed a minor change. Subject child 4 had lower vocal strength in the post-test period. Finally, although each subject had individual difference in syllable diadochokinetic ability, the function was improved and the number of repetition in one respiration was also increased.

  • PDF

말지각의 기초표상: 음소 또는 변별자질 (The Primitive Representation in Speech Perception: Phoneme or Distinctive Features)

  • 배문정
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.157-169
    • /
    • 2013
  • Using a target detection task, this study compared the processing automaticity of phonemes and features in spoken syllable stimuli to determine the primitive representation in speech perception, phoneme or distinctive feature. For this, we modified the visual search task(Treisman et al., 1992) developed to investigate the processing of visual features(ex. color, shape or their conjunction) for auditory stimuli. In our task, the distinctive features(ex. aspiration or coronal) corresponded to visual primitive features(ex. color and shape), and the phonemes(ex. /$t^h$/) to visual conjunctive features(ex. colored shapes). The automaticity is measured by the set size effect that was the increasing amount of reaction time when the number of distracters increased. Three experiments were conducted. The laryngeal features(experiment 1), the manner features(experiment 2), and the place features(experiment 3) were compared with phonemes. The results showed that the distinctive features are consistently processed faster and automatically than the phonemes. Additionally there were differences in the processing automaticity among the classes of distinctive features. The laryngeal features are the most automatic, the manner features are moderately automatic and the place features are the least automatic. These results are consistent with the previous studies(Bae et al., 2002; Bae, 2010) that showed the perceptual hierarchy of distinctive features.

Phonological processes of vowels from orthographic to pronounced words in the Buckeye Corpus by sex and age groups

  • Yang, Byunggon
    • 말소리와 음성과학
    • /
    • 제10권2호
    • /
    • pp.25-31
    • /
    • 2018
  • This paper investigated the phonological processes of monophthongs and diphthongs in the pronounced words present in the Buckeye Corpus and compared the frequency distribution of these processes by sex and age groups to provide a clearer understanding of spoken English to linguists and phoneticians. Both orthographic and pronounced words were extracted from the transcribed label scripts of the Buckeye Corpus using R. Next, the phonological processes of monophthongs and diphthongs in the orthographic and pronounced labels were tabulated using R scripts, and a frequency distribution by vowel process types, as well as sex and age groups, was created. The results revealed that 95% of the orthographic words contained the same number of syllables, whereas 5% had different numbers of vowels, thereby proving that speakers tend to preserve vowels in spontaneous speech. In addition, deletion processes were preferred in natural speech. Most vowel deletions occurred with an unstressed syllable. Chi-square tests were performed to calculate dependence in the distribution of phonological process types for male and female groups and young and old groups. The results showed a very strong correlation. This finding indicates that vowel processes occurred in approximately the same pattern in natural and spontaneous speech data regardless of sex and age, as well as whether or not the vowel processes were identical. Based on these results, the author concludes that an analysis of phonological processes in spontaneous speech corpora can greatly enhance practical understanding of spoken English.

서울-경기 지역 2-6세 아동의 발달기적 음운변동에 관한 연구 - 자음을 중심으로 - (A study of phonological regression in 2-6 years of Korean children)

  • 김영태
    • 대한음성학회지:말소리
    • /
    • 제21_24호
    • /
    • pp.3-24
    • /
    • 1992
  • This study was designed to investigate the changes of phonological processes in normal Korean children aged from 2- to 6-years. Forty eight children who lived in Seoul or Kyung-Ki do were tested with a picture articulation test and their articulation errors including omissions, additions and substitutions were coded into phonological processes. Those phonological processes were discussed in several ways: syllable structure, place, manner, assimilation, tenseness, and aspiration of sounds. Data were analyzed by two ways: (1) number of subjects who showed each process and (2) percentage of occurrence of each process. Analyses in omission-addition processes demonstrated that postvocalic omission occurred most frequently, followed by velar-, alveolar-, and glottal omission. Analyses in substitution processes showed that fronting (palatal and velar), backing (alveolar), and alveolization occurred most frequently in terms of the place of sounds. In terms of assimilation, alveolar-, stopping, and aspiration assimilation occurred frequently. Analyses by the tenseness and aspiration showed similar occurrences among the 4 processes, with slightly higher occurrences in tensing and aspiration than lanxing and deaspiration. All of the processes decreased by age. The numbers of the processes showed by more than half of the children or exceeded 10% of occurrence were 20 in 2-years of age, 10 in 3-years of age, 1 in 4-years of age, and none in ages of 5 and 6.

  • PDF