• 제목/요약/키워드: Speech articulation

검색결과 357건 처리시간 0.038초

Dysarthric speaker identification with different degrees of dysarthria severity using deep belief networks

  • Farhadipour, Aref;Veisi, Hadi;Asgari, Mohammad;Keyvanrad, Mohammad Ali
    • ETRI Journal
    • /
    • 제40권5호
    • /
    • pp.643-652
    • /
    • 2018
  • Dysarthria is a degenerative disorder of the central nervous system that affects the control of articulation and pitch; therefore, it affects the uniqueness of sound produced by the speaker. Hence, dysarthric speaker recognition is a challenging task. In this paper, a feature-extraction method based on deep belief networks is presented for the task of identifying a speaker suffering from dysarthria. The effectiveness of the proposed method is demonstrated and compared with well-known Mel-frequency cepstral coefficient features. For classification purposes, the use of a multi-layer perceptron neural network is proposed with two structures. Our evaluations using the universal access speech database produced promising results and outperformed other baseline methods. In addition, speaker identification under both text-dependent and text-independent conditions are explored. The highest accuracy achieved using the proposed system is 97.3%.

항공 통신, 항법, 식별장비 시험평가 (Test & Evaluation of Airborne Communication, Navigation, Identification Equipment)

  • 김성우;김민수;이영식;이병화;오우섭
    • 한국군사과학기술학회지
    • /
    • 제15권5호
    • /
    • pp.615-622
    • /
    • 2012
  • Airborne radio communications, navigation, and identification equipments are the basic equipment of airplane. Airborne radio communications, navigation, and identification equipments are characterized by user's many quantitative and qualitative requirements. These equipment look like simple test and evaluation, but they have many complex factors. This paper describe the test and evaluation of airborne radio communications, navigation, and identification equipments.

영어 나열형 고립 단어 읽기에서 어말 폐쇄음의 파열 양상 및 그 음성적 상관성 (Aspects of the word-final stop releasing and its phonetic correlates in reading the English isolated words enumerated)

  • 이석재;강수하;박지현;황선민
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.61-68
    • /
    • 2003
  • This experimental research shows that, in reading of the English isolated words that are enumerated, the releasing of the word-final stop is employed for signaling enumeration in company with the well-known intonational pattern for it. Furthermore, this study tries to find the conceivable phonetic correlates of the releasing of the stop in word-final position, focusing on the association of the stop releasing/nonreleasing with i) the POA (Place of Articulation) distinction of the word-final stop, ii) the various qualities of the preceding vowel placed before the final stop, and iii) the voice distinction of the stop in the word-final position.

  • PDF

검사어 및 성별.연령에 따른 음절 최대 반복 속도에 대한 연구 (A Study of Syllable Maximum Repetition Rate for Stimuli, Age and Sex)

  • 최홍식;차정민;심현섭
    • 대한후두음성언어의학회지
    • /
    • 제12권1호
    • /
    • pp.55-60
    • /
    • 2001
  • Background and Objectives : Syllable Maximum Repetition Rate(MRR) is ability to repeat rapidly the articulators and is assessed for oromechanism function as one of the MPT. MRR is measured by rate(counts/sec), also simultaneously considered accuracy and consistency. The objective of the present was to examine stimuli effects and age and sex differences for MRR. Materials and Method : This study was participated 60 normal males and females(1 : 1) who were divided into two groups young(<40 years old) and old($\geq$40 years old). Stimuli were $/{P^=}a/,/{t^=}a/,/{k^=}a/,/{P^h}a/,/{t^h}a/,/{k^h}a/,/{P^=}{at^=}{ak^=}a/$ for, manner(tense and aspirated) of articulation, $/{p^h}{at^h}{ak^h}a/,/{t^h}{ap^h}{ak^h}a/$ for the effect of the order of syllable, glide /u-i/ for coordination of lip and tongue, interrupted vowel /i/ for laryngeal function. Results : There were little differences in two age groups and sex and manner of articulation for MRR tasks. The fastest average MRR of the single syllable included in this study was $/{t^=}a/$. significant differences existed between MRR for ${p^h}{at^h}{ak^h}a/ and /{t^h}{ap^h}{ak^h}a/$, which suggested that MRR was affected by the order of the syllables. MRR for interrupted vowel /i/ was about 2 counts/sec slower than average rate of 1 syllables. Conclusion : From these results, the order of the syllables was a crucial variable in MRR. rather than age or sex. There were no difference age, sex and manner of rate. The interrupted vowel repetition rate was slightly slow used and can provide basic information to assess the speech mechanism and can be useful to develop effective stimuli to differentiate the disordered group from normal.

  • PDF

Angle씨 II급 1류 부정교합아동의 발음에 관한 음향학적 연구 (AN ACOUSTIC ANALYSIS OF PRONUNCIATION IN CHILDREN WITH ANGLE'S CLASS II DIV. 1 MALOCCLUSION)

  • 박윤정;이상훈;손동수
    • 대한소아치과학회지
    • /
    • 제24권1호
    • /
    • pp.95-111
    • /
    • 1997
  • The human speech organ consists of respiration system (lung, larynx), phonation system (vocal cord), articulation system (esophagus, pharynx, uvula, teeth, gingiva, palate, tongue, lip) and resonating system(oral cavity, nasal cavity, paranasal sinus). Because teeth are components of the articulation system, it has been reported that the persons with abnormally positioned teeth generally have abnormal occlusion and pronunciation. In this study, using /ㅅ(s)/, the most commonly mispronunced consonant in children with malocclusion, and the seven single vowels, /사(sa), 서($s\delta$), 소(so), 수(su), 스($s\omega$), 시(si), 세(se)/ and / ㅏ(a), ㅓ($\delta$), ㅗ(o), ㅜ(u), ㅡ($\omega$), 1(i), ㅔ(e)/ were recorded and analyzed using speech analysis program on computer by measuring formants and compared them for investigating the differences in pronunciation in children with Angle's class I occlusions and those with Angle's class II div.1 malocclusion. The result were as follows: 1. In the Angle's Class II div.1 group, there were no significant differences in F1 of all recorded sounds as compared with Angle's Class I group(p>0.05). 2. In the consonants, there were significant differences in F2 of /스($s\omega$)/ and F2/F1 ratio of /사(sa), 서($s\delta$), 시(si)/ between the two group(p<0.05). 3. In the vowels, there were significant differences F2/F1 ratio of /ㅓ($\delta$)/(p<0.05) and no significant differences in F2/F1 ratio between two group(p>0.05). 4. In the consonants, there were significant differences in F2 and F2/F1 ratio when succeeding vowels were high or low, and F2/F1 ratio when front in accordance with tongue position (p<0.05). 5. In the vowels, there were no significant differences in formant in accordance with tongue position(p>0.05)

  • PDF

Effects of low-dose topiramate on language function in children with migraine

  • Han, Seung-A;Yang, Eu Jeen;Kong, Younghwa;Joo, Chan-Uhng;Kim, Sun Jun
    • Clinical and Experimental Pediatrics
    • /
    • 제60권7호
    • /
    • pp.227-231
    • /
    • 2017
  • Purpose: This study aimed to verify the safety of low-dose topiramate on language development in pediatric patients with migraine. Methods: Thirty newly diagnosed pediatric patients with migraine who needed topiramate were enrolled and assessed twice with standard language tests, including the Test of Language Problem Solving Abilities (TOPs), Receptive and Expressive Vocabulary Test, Urimal Test of Articulation and Phonology, and computerized speech laboratory analysis. Data were collected before treatment, and topiramate as monotherapy was sustained for at least 3 months. The mean follow-up period was $4.3{\pm}2.7months$. The mean topiramate dosage was 0.9 mg/kg/day. Results: The patient's mean age was $144.1{\pm}42.3months$ (male-to-female ratio, 9:21). The values of all the language parameters of the TOPs were not changed significantly after the topiramate treatment as follows: Determine cause, from $15.0{\pm}4.4$ to $15.4{\pm}4.8$ (P>0.05); making inference, from $17.6{\pm}5.6$ to $17.5{\pm}6.6$ (P>0.05); predicting, from $11.5{\pm}4.5$ to $12.3{\pm}4.0$ (P>0.05); and total TOPs score, from $44.1{\pm}13.4$ to $45.3{\pm}13.6$ (P>0.05). The total mean length of utterance in words during the test decreased from $44.1{\pm}13.4$ to $45.3{\pm}13.6$ (P<0.05). The Receptive and Expressive Vocabulary Test results decreased from $97.7{\pm}22.1$ to $96.3{\pm}19.9months$, and from $81.8{\pm}23.4$ to $82.3{\pm}25.4months$, respectively (P>0.05). In the articulation and phonology validation in both groups, speech pitch and energy were not significant, and all the vowel test results showed no other significant values. Conclusion: No significant difference was found in the language-speaking ability between the patients; however, the number of vocabularies used decreased. Therefore, topiramate should be used cautiously for children with migraine.

입술 애니메이션을 위한 한글 발음의 동시조음 모델 (Coarticulation Model of Hangul Visual speedh for Lip Animation)

  • 공광식;김창헌
    • 한국정보과학회논문지:시스템및이론
    • /
    • 제26권9호
    • /
    • pp.1031-1041
    • /
    • 1999
  • 기존의 한글에 대한 입술 애니메이션 방법은 음소의 입모양을 몇 개의 입모양으로 정의하고 이들을 보간하여 입술을 애니메이션하였다. 하지만 발음하는 동안의 실제 입술 움직임은 선형함수나 단순한 비선형함수가 아니기 때문에 보간방법에 의해 중간 움직임을 생성하는 방법으로는 음소의 입술 움직임을 효과적으로 생성할 수 없다. 또 이 방법은 동시조음도 고려하지 않아 음소들간에 변화하는 입술 움직임도 표현할 수 없었다. 본 논문에서는 동시조음을 고려하여 한글을 자연스럽게 발음하는 입술 애니메이션 방법을 제안한다. 비디오 카메라로 발음하는 동안의 음소의 움직임들을 측정하고 입술 움직임 제어 파라미터들을 추출한다. 각각의 제어 파라미터들은 L fqvist의 스피치 생성 제스처 이론(speech production gesture theory)을 이용하여 실제 음소의 입술 움직임에 근사한 움직임인 지배함수(dominance function)들로 정의되고 입술 움직임을 애니메이션할 때 사용된다. 또, 각 지배함수들은 혼합함수(blending function)와 반음절에 의한 한글 합성 규칙을 사용하여 결합하고 동시조음이 적용된 한글을 발음하게 된다. 따라서 스피치 생성 제스처 이론을 이용하여 입술 움직임 모델을 구현한 방법은 기존의 보간에 의해 중간 움직임을 생성한 방법보다 실제 움직임에 근사한 움직임을 생성하고 동시조음도 고려한 움직임을 보여준다.Abstract The existing lip animation method of Hangul classifies the shape of lips with a few shapes and implements the lip animation with interpolating them. However it doesn't represent natural lip animation because the function of the real motion of lips, during articulation, isn't linear or simple non-linear function. It doesn't also represent the motion of lips varying among phonemes because it doesn't consider coarticulation. In this paper we present a new coarticulation model for the natural lip animation of Hangul. Using two video cameras, we film the speaker's lips and extract the lip control parameters. Each lip control parameter is defined as dominance function by using L fqvist's speech production gesture theory. This dominance function approximates to the real lip animation of a phoneme during articulation of one and is used when lip animation is implemented. Each dominance function combines into blending function by using Hangul composition rule based on demi-syllable. Then the lip animation of our coarticulation model represents natural motion of lips. Therefore our coarticulation model approximates to real lip motion rather than the existing model and represents the natural lip motion considered coarticulation.

편도암 절제술후 전완유리피판술을 이용한 연구개 결손부 재건의 기능적 결과 (Functional Results of Soft Palate Defect Reconstruction using Radial Forearm Free Flap after Tonsil Cancer Surgery)

  • 김민식;선동일;박해섭;조승호;제현순
    • 대한기관식도과학회지
    • /
    • 제5권2호
    • /
    • pp.191-197
    • /
    • 1999
  • Background and Objective : Soft palate plays a great role in function of speech and swallowing. Ablation of tonsil cancer results in multi-demensional defect including soft palate in most cases and restoration of the postoperative oral cavity function is a continuing surgical challenge. Although a variety of techniques are available, radial forearm free flap has been known as an effective method for these defect, which offers a thin, pliable, and relatively hairless skin, and a long vascular pedicle. The aim of the present study is to report the speech and swallowing function test results of our 5 consecutive radial forearm free flaps used for tonsil cancers. Materials and Methods : We reviewed the medical records of 5 patients who were offered intraoral reconstruction with a radial forearm free flap after ablative surgery for tonsil cancers, from Dec. 1997 to Oct. 1998, and analyzed the surgical methods, complications, and speech and swallowing function test results. We have examined with modified barium swallow to evaluate postoperative wallowing function and articulation and resonance test for speech. Results : The tumor sizes by TNM stage(AJCC, 1997) were T1(1), T2(2), and T4(3). The paddles of flaps were tailored in multilobed designs from oval shape to pentalobed design and in variable size from 24$cm^2$ to 108$cm^2$(average size = 78.4$cm^2$), according to the defect after ablation. This procedures resulted in satisfactory flap success and functional results all but 1 case of flap contracture in 2 postoperative week, achieved early oral diet until 16-57 postoperative day(average, 28 days) and social speech. The oropharyngeal defect including soft palate reconstruction with radial forearm free flap might be an excellent method for the maximal functional results, after ablative surgery of tonsil cancer that results in multidimensional defect.

  • PDF

청각장애 아동과 일반 아동의 마찰음에 나타난 음향음성학적 특성 비교 (Comparison of Acoustic Phonetic Characteristics of Korean Fricative Sounds Pronounced by Hearing-impaired Children and Normal Children)

  • 김윤하;김은연;장승진;최예린
    • 말소리와 음성과학
    • /
    • 제6권2호
    • /
    • pp.73-79
    • /
    • 2014
  • Alveolar fricative sounds /s/ and /s'/ are learned last for normal children in the speech development process for Koreans. These are especially difficult to articulate for hearing-impaired children often causing articulation errors. The acoustic phonetic evaluation uses testing tools to provide indirect and object information. These objective resources can be compared with standardized resources on speech when interpreting the results of a test. However, most previous studies in Korea did not consider acoustic studies that used the spectrum moment values of hearing-impaired children. Therefore, this study was conducted to compare the characteristics of hearing-impaired children's pronunciation of fricative sounds using spectrum moment values. For this purpose, the study selected a total of 10 hearing-impaired children (5 boys and 5 girls) currently in 3rd or 5th grade and attending one of the elementary schools in Seoul or Gyeonggi-do. For the selection process, their age, type of hearing aid, implantation of hearing aid (CI) before two years of age, hearing capacity (dB) before and after wearing the hearing aid, duration of speech rehabilitation, and time of learning alveolar fricative sounds were all considered. Also, 10 normal children (5 boys and 5 girls) were selected among 3rd or 5th grade students attending one of the elementary schools in Seoul or Gyeonggi-do. The subjects were asked to read the carrier sentence, "I say _______," including a list of 12 meaningless syllables composed of CV and VCV syllables, including alveolar fricative sounds /s/ and /s'/ and vowels /a/, /i/, and /u/. The recorded resources were processed through the Time-frequency Analysis Software Program to measure M1 (mean), M2 (variance), M3 (skewness), and M4 (kurtosis) of the fricative noise. No significant differences were found when comparing spectrum threshold values in the acoustic phonetic characteristics of hearing-impaired children and normal children in alveolar fricative sound pronunciation according to vowels /a/, /i/, and /u/, alveolar fricative sounds /s/ and /s'/, and syllable structure (CV, VCV) other than, for M3 in the comparison of groups according to disability. In the comparison of syllable structures, there were statistically significant differences in M1, M2, M3, and M4 with clinical significance. However, there was no significant difference in results when comparing the alveolar fricative sounds according to the vowels.

리듬 및 유창성 지수와 한국 화자의 영어 읽기 발화 청취 평가의 관련성 (Relationships between rhythm and fluency indices and listeners' ratings of Korean speakers' English paragraph reading)

  • 정현성
    • 말소리와 음성과학
    • /
    • 제14권4호
    • /
    • pp.25-33
    • /
    • 2022
  • 이 논문은 리듬 및 유창성 지수와 한국 대학생이 읽은 영어 발화의 리듬 및 유창성 청취 평가가 어떤 관련성을 가지는지 분석한 연구이다. 이를 위해 대학생 17명이 'The North Wind and the Sun' 문단을 읽고 영어 발음 수업 사전, 사후 두 차례에 걸쳐 녹음한 것을, 대학원에 재학 중인 현직 영어 교사 및 예비 영어 교사 7명이 리듬 및 유창성에 대한 청취 평가를 진행하였다. 또, 선행 연구에서 언급된 리듬 및 유창성 지수 중 14개 지수를 사용해 각 발화 자료의 지수를 추출하여 지수와 청취 평가의 관련성을 분석하였다. 지수를 분석한 결과 %V, VarcoV, nPVIV에서는 선행 연구의 영어 원어민 지수와 거의 비슷한 양상을 보였고, ΔV, ΔC, VarcoC에서는 원어민보다 높았으며, 발화 속도는 원어민보다 느렸다. 한국 대학생들이 일부 리듬 지수에서는 영어 원어민과 비슷한 양상을 보인다고 할 수 있다. 리듬 및 유창성 지수를 사용하여 리듬 평가 점수와 유창성 평가 점수를 예측할 수 있는 최적의 모델을 탐색한 결과, 리듬 및 유창성 평가 모두에서 유창성 지수인 휴지 비율, 조음 속도, 발화 속도 등이 리듬 지수들보다 평가 점수를 예측하는데 더 큰 기여를 하는 것으로 나타났다.