• Title/Summary/Keyword: consonants and vowels

Search Result 200, Processing Time 0.02 seconds

Articulation Scores and Confusion Patterns of the 100 Monosyllable Korean Speech Sounds (우리말 100단음절의 명료도와 오청상에 관한 연구)

  • 유방환;김홍기;노관택
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1972.03a
    • /
    • pp.1.1-1
    • /
    • 1972
  • It is well known that speech signals are the most riliable materials for the hearing test and there are various difficult problems in the selection of these materials. Because of these difficulties, there is not a confirmed test material of Korean speech sound up to date. For the basis of the test materials, author had studied articulation scores and confusion patterns of 100 monosyllable korean speech sounds in normal listners, in normal listners under various noisy (white noise and speech noise) circumstances, and in patients with hearing loss, The results reveal as follows. 1. Except for perceptive deafness with poor articulation score, Confusion was occured among initial consonants, vowels and final consonants respectively according to their distinctive features under above various test conditions. 2. There is remarkable differences in articulation scores between different kindes of noise under some intensity levels.

  • PDF

Phoneme Segmentation based on Volatility and Bulk Indicators in Korean Speech Recognition (한국어 음성 인식에서 변동성과 벌크 지표에 기반한 음소 경계 검출)

  • Lee, Jae Won
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.10
    • /
    • pp.631-638
    • /
    • 2015
  • Today, the demand for speech recognition systems in mobile environments is increasing rapidly. This paper proposes a novel method for Korean phoneme segmentation that is applicable to a phoneme based Korean speech recognition system. First, the input signal constitutes blocks of the same size. The proposed method is based on a volatility indicator calculated for each block of the input speech signal, and the bulk indicators calculated for each bulk in blocks, where a bulk is a set of adjacent samples that have the same sign as that of the primitive indicators for phoneme segmentation. The input signal vowels, voiced consonants, and voiceless consonants are sequentially recognized and the boundaries among phonemes are found using three devoted recognition algorithms that combine the two types of primitive indicators. The experimental results show that the proposed method can markedly reduce the error rate of the existing phoneme segmentation method.

A study on the lip shape recognition algorithm using 3-D Model (3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구)

  • 김동수;남기환;한준희;배철수;나상동
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 1998.11a
    • /
    • pp.181-185
    • /
    • 1998
  • Recently, research and developmental direction of communication system is concurrent adopting voice data and face image in speaking to provide more higher recognition rate then in the case of only voice data. Therefore, we present a method of lipreading in speech image sequence by using the 3-D facial shape model. The method use a feature information of the face image such as the opening-level of lip, the movement of jaw, and the projection height of lip. At first, we adjust the 3-D face model to speeching face image sequence. Then, to get a feature information we compute variance quantity from adjusted 3-D shape model of image sequence and use the variance quality of the adjusted 3-D model as recognition parameters. We use the intensity inclination values which obtaining from the variance in 3-D feature points as the separation of recognition units from the sequential image. After then, we use discrete HMM algorithm at recognition process, depending on multiple observation sequence which considers the variance of 3-D feature point fully. As a result of recognition experiment with the 8 Korean vowels and 2 Korean consonants, we have about 80% of recognition rate for the plosives and vowels.

  • PDF

Platybasia in 22q11.2 Deletion Syndrome Is Not Correlated with Speech Resonance

  • Spruijt, Nicole E.;Kon, Moshe;Molen, Aebele B. Mink Van Der
    • Archives of Plastic Surgery
    • /
    • v.41 no.4
    • /
    • pp.344-349
    • /
    • 2014
  • Background An abnormally obtuse cranial base angle, also known as platybasia, is a common finding in patients with 22q11.2 deletion syndrome (22q11DS). Platybasia increases the depth of the velopharynx and is therefore postulated to contribute to velopharyngeal dysfunction. Our objective was to determine the clinical significance of platybasia in 22q11DS by exploring the relationship between cranial base angles and speech resonance. Methods In this retrospective chart review at a tertiary hospital, 24 children (age, 4.0-13.1 years) with 22q11.2DS underwent speech assessments and lateral cephalograms, which allowed for the measurement of the cranial base angles. Results One patient (4%) had hyponasal resonance, 8 (33%) had normal resonance, 10 (42%) had hypernasal resonance on vowels only, and 5 (21%) had hypernasal resonance on both vowels and consonants. The mean cranial base angle was $136.5^{\circ}$ (standard deviation, $5.3^{\circ}$; range, $122.3-144.8^{\circ}$). The Kruskal-Wallis test showed no significant relationship between the resonance ratings and cranial base angles (P=0.242). Cranial base angles and speech ratings were not correlated (Spearman correlation=0.321, P=0.126). The group with hypernasal resonance had a significantly more obtuse mean cranial base angle ($138^{\circ}$ vs. $134^{\circ}$, P=0.049) but did not have a greater prevalence of platybasia (73% vs. 56%, P=0.412). Conclusions In this retrospective chart review of patients with 22q11DS, cranial base angles were not correlated with speech resonance. The clinical significance of platybasia remains unknown.

A study on the lip shape recognition algorithm using 3-D Model (3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구)

  • 남기환;배철수
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.5
    • /
    • pp.783-788
    • /
    • 2002
  • Recently, research and developmental direction of communication system is concurrent adopting voice data and face image in speaking to provide more higher recognition rate then in the case of only voice data. Therefore, we present a method of lipreading in speech image sequence by using the 3-D facial shape model. The method use a feature information of the face image such as the opening-level of lip, the movement of jaw, and the projection height of lip. At first, we adjust the 3-D face model to speeching face Image sequence. Then, to get a feature information we compute variance quantity from adjusted 3-D shape model of image sequence and use the variance quality of the adjusted 3-D model as recognition parameters. We use the intensity inclination values which obtaining from the variance in 3-D feature points as the separation of recognition units from the sequential image. After then, we use discrete HMM algorithm at recognition process, depending on multiple observation sequence which considers the variance of 3-D feature point fully. As a result of recognition experiment with the 8 Korean vowels and 2 Korean consonants, we have about 80% of recognition rate for the plosives md vowels.

Automatic Phonetic Segmentation of Korean Speech Signal Using Phonetic-acoustic Transition Information (음소 음향학적 변화 정보를 이용한 한국어 음성신호의 자동 음소 분할)

  • 박창목;왕지남
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.8
    • /
    • pp.24-30
    • /
    • 2001
  • This article is concerned with automatic segmentation for Korean speech signals. All kinds of transition cases of phonetic units are classified into 3 types and different strategies for each type are applied. The type 1 is the discrimination of silence, voiced-speech and unvoiced-speech. The histogram analysis of each indicators which consists of wavelet coefficients and SVF (Spectral Variation Function) in wavelet coefficients are used for type 1 segmentation. The type 2 is the discrimination of adjacent vowels. The vowel transition cases can be characterized by spectrogram. Given phonetic transcription and transition pattern spectrogram, the speech signal, having consecutive vowels, are automatically segmented by the template matching. The type 3 is the discrimination of vowel and voiced-consonants. The smoothed short-time RMS energy of Wavelet low pass component and SVF in cepstral coefficients are adopted for type 3 segmentation. The experiment is performed for 342 words utterance set. The speech data are gathered from 6 speakers. The result shows the validity of the method.

  • PDF

The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification (청음 음성학적 지식에 기반한 음가분류에 의한 핵심어 검출 시스템 구현)

  • Kim, Hack-Jin;Kim, Soon-Hyub
    • The KIPS Transactions:PartB
    • /
    • v.10B no.2
    • /
    • pp.169-178
    • /
    • 2003
  • This study outlines two viewpoints the classification of phone likely unit (PLU) which is the foundation of korean large vocabulary speech recognition, and the effectiveness of Chiljongseong (7 Final Consonants) and Paljogseong (8 Final Consonants) of the korean language. The phone likely classifies the phoneme phonetically according to the location of and method of articulation, and about 50 phone-likely units are utilized in korean speech recognition. In this study auditory phonetical knowledge was applied to the classification of phone likely unit to present 45 phone likely unit. The vowels 'ㅔ, ㅐ'were classified as phone-likely of (ee) ; 'ㅒ, ㅖ' as [ye] ; and 'ㅚ, ㅙ, ㅞ' as [we]. Secondly, the Chiljongseong System of the draft for unified spelling system which is currently in use and the Paljongseonggajokyong of Korean script haerye were illustrated. The question on whether the phonetic value on 'ㄷ' and 'ㅅ' among the phonemes used in the final consonant of the korean fan guage is the same has been argued in the academic world for a long time. In this study, the transition stages of Korean consonants were investigated, and Ciljonseeng and Paljongseonggajokyong were utilized in speech recognition, and its effectiveness was verified. The experiment was divided into isolated word recognition and speech recognition, and in order to conduct the experiment PBW452 was used to test the isolated word recognition. The experiment was conducted on about 50 men and women - divided into 5 groups - and they vocalized 50 words each. As for the continuous speech recognition experiment to be utilized in the materialized stock exchange system, the sentence corpus of 71 stock exchange sentences and speech corpus vocalizing the sentences were collected and used 5 men and women each vocalized a sentence twice. As the result of the experiment, when the Paljongseonggajokyong was used as the consonant, the recognition performance elevated by an average of about 1.45% : and when phone likely unit with Paljongseonggajokyong and auditory phonetic applied simultaneously, was applied, the rate of recognition increased by an average of 1.5% to 2.02%. In the continuous speech recognition experiment, the recognition performance elevated by an average of about 1% to 2% than when the existing 49 or 56 phone likely units were utilized.

Development of the Korean Handwriting Assessment for Children Using Digital Image Processing

  • Lee, Cho Hee;Kim, Eun Bin;Lee, Onseok;Kim, Eun Young
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.8
    • /
    • pp.4241-4254
    • /
    • 2019
  • The efficiency and accuracy of handwriting measurement could be improved by adopting digital image processing. This study developed a computer-based Korean Handwriting Assessment tool. Second graders participated in this study by performing writing tasks of consonants, vowels, words, and sentences. We extracted boundary parameters for each letter using digital image processing and calculated the variables of size, size coefficient of variation (CV), misalignment, inter-letter space, inter-word space, and ratio of inter-letter space to inter-word space. Children were also administered traditional handwriting and visuomotor tests. Digital variables from image processing were correlated with these previous tests. Using these correlations, we established a three-point scoring system that computed test scores for each variable. We analyzed inter-rater reliability between the computer rater and human rater and test-retest reliability between the first and second performances. The validity was examined by analyzing the relationship between the Korean Handwriting Assessment and previous handwriting and visuomotor tests. We suggested the Korean Handwriting Assessment to measure size, size consistency, misalignment, inter-letter space, inter-word space, and space ratio using digital image processing. This Korean Handwriting Assessment tool proved to have reliability and validity. It is expected to be useful for assessing children's handwriting.

Speech Characteristics of Patients with Cleft Palates Based on Objective Measurements (구개열 환자 언어의 음성언어의학적 특징 연구)

  • 박혜숙;최홍식;김현기
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.2
    • /
    • pp.124-131
    • /
    • 2002
  • Speech characteristics of patients with cleft palates are resonance disorders, articulatory disorders and voice disorders. The purpose of this study is to find the acoustic, physiological and articulatory characteristics of cleft palate speakers. Thirteen control groups and 3 cleft palate patients participated in this experiment. Test words were composed of simple vowels and consonants imbedded in low vowel /a/, /p 'ap'i/ and /sasi/ according to the evaluation experiments. CSL, Video fluoroscopy, Fiberscope and Nasometer were used to analyze VOT, vowel formants, profiles of articulator, VP port images and nasalance. The results are as follows : (1) The nasalance of cleft palate patients in the high vowel /i/, stop sounds and fricative sounds were 60%, 34.8% and 44.1%, respectively. These values were higher than those of the control group. (2) Posterior articulatory movements /k'a/ in patients with cleft palates showed backward movement in comparison with the control group on Video Fluoroscopic images and palatograms. These results suggested that patients with cleft palate have the compensatory oral sounds to close the VP port. (3) The VOT in patients with cleft palates was longer than that of the control group.

  • PDF

Aerodynamic Characteristics of Korean Bilabial Stop Consonant as a Function of Phonemic Position in a Syllable (음절내 음소 출현 위치에 따른 한국어 양순 파열음의 공기역학적인 특징)

  • Park, Sang-Hee;Jeong, Haeng-Im;Jeong, Ok-Ran;Seok, Dong-Il
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.59-75
    • /
    • 2002
  • Aerodynamic analysis study was performed on 14 normal subjects (2 males, 12 females) with nonsense syllables composed of Korean bilabial stops (/p, p', $p^{h}$) and their preceding and/or following vowels, /i, a, u/. That is, [pi, p'i, $p^{h}i$, pa, p'a, $p^{h}a$, pu, p'u, $p^{h}u$, ipi, apa, upu, $ip^{h}i$, $ap^{h}a$, $up^{h}u$, ip'i, ap'a, up'u]. All measures were taken and analysed using Aerophone II voice function analyzer and included peak air pressure, mean air pressure, maximum flow rate, volume, mean SPL and phonatory SPL. A t-test and one-way ANOVA were employed for analysis. A post-hoc analysis was performed with Scheffe and Bonferroni. The results were as follows: First, MSPL. and MAP of /p, p', $p^{h}$/ were significantly different in different positions (initial and medial position). In addition, different vowel environment also produced significantly different aerodynamic characteristics those consonants. Especially the lax consonant /p/ was significantly different /i, a, u/ vowel environments. The tense consonant /p'/ was significantly different only /i/ vowel environment.

  • PDF