• Title/Summary/Keyword: vowel context

Search Result 33, Processing Time 0.021 seconds

Effects of vowel types and sentence positions in standard passage on auditory and cepstral and spectral measures in patients with voice disorders (모음 유형과 표준문단의 문장 위치가 음성장애 환자의 청지각적 및 켑스트럼 및 스펙트럼 분석에 미치는 효과)

  • Mi-Hyeon Choi;Seong Hee Choi
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.81-90
    • /
    • 2023
  • Auditory perceptual assessment and acoustic analysis are commonly used in clinical practice for voice evaluation. This study aims to explore the effects of speech task context on auditory perceptual assessment and acoustic measures in patients with voice disorders. Sustained vowel phonations (/a/, /e/, /i/, /o/, /u/, /ɯ/, /ʌ/) and connected speech (a standardized paragraph 'kaeul' and nine sub-sentences) were obtained from a total of 22 patients with voice disorders. GRBAS ('G', 'R', 'B', 'A', 'S') and CAPE-V ('OS', 'R', 'B', 'S', 'P', 'L') auditory-perceptual assessment were evaluated by two certified speech language pathologists specializing in voice disorders using blind and random voice samples. Additionally, spectral and cepstral measures were analyzed using the analysis of dysphonia in speech and voice model (ADSV).When assessing voice quality with the GRBAS scale, it was not significantly affected by the vowel type except for 'B', while the 'OS', 'R' and 'B' in CAPE-V were affected by the vowel type (p<.05). In addition, measurements of CPP and L/H ratio were influenced by vowel types and sentence positions. CPP values in the standard paragraph showed significant negative correlations with all vowels, with the highest correlation observed for /e/ vowel (r=-.739). The CPP of the second sentence had the strongest correlation with all vowels. Depending on the speech stimulus, CAPE-V may have a greater impact on auditory-perceptual assessment than GRBAS, vowel types and sentence position with consonants influenced the 'B' scale, CPP, and L/H ratio. When using vowels in the voice assessment of patients with voice disorders, it would be beneficial to use not only /a/, but also the vowel /i/, which is acoustically highly correlated with 'breathy'. In addition, the /e/ vowel was highly correlated acoustically with the standardized passage and sub-sentences. Furthermore, given that most dysphonic signals are aperiodic, 2nd sentence of the 'kaeul' passage, which is the most acoustically correlated with all vowels, can be used with CPP. These results provide clinical evidence of the impact of speech tasks on auditory perceptual and acoustic measures, which may help to provide guidelines for voice evaluation in patients with voice disorders.

The Effect of Phonetic Contexts on Nasalance Score for Normal Adults (음운 환경이 정상 성인의 비음치에 미치는 영향)

  • 김민정;심현섭
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.10 no.2
    • /
    • pp.97-101
    • /
    • 1999
  • The nasalance score measured by Nasometer is a supplementary data for the perceptually rated nasality by a trained speech pathologist. Because the nasalance score varies with speech material, a valid and reliable material should be developed for evaluating it. The objectives of the present study were (1) to examine whether phonetic contexts affect the nasalace score and (2) to examine the reliability of both meaningless one-syllable words and meaningful sentences. This study analyzed nasalance score in 20 different phonetic contexts from 24 normal adults. The results showed (1) nasalance score increased as the percentage of nasal consonants and vowel /i/ increased, (2) the manner and the place of articulation in oral consonants did not influence the nasalance score, and (3) in nasalance score, correlation between sentences was found to be high, but correlation between syllables was not. These results may indicate that, when preparing the speech material for measuring the nasalance score, it is important to consider not only the percentage of nasal consonants but also that of vowel /i/ in the speech material. In addition, the sentence is more reliable material than meaningless one-syllable words.

  • PDF

Perception and production of English fricatives by Chinese learners of English: Error patterns and perception-production relationship

  • Zhang, Buyi;Zhang, Jiaqi;Lee, Sook-hyang
    • Phonetics and Speech Sciences
    • /
    • v.13 no.1
    • /
    • pp.25-36
    • /
    • 2021
  • This study examined the perception and production of eight English fricatives /f/, /v/, /θ/, /ð/, /s/, /z/, /ʃ/, and /ʒ/ by thirty Chinese English majors and thirty Chinese middle school students through a fricative identification test, an intelligibility test, and a goodness rating test and focused on error patterns and the perception-production relationship. The results showed that substitution errors occurred frequently in the perception and production of English fricatives by both the English majors and the middle school students. Further, the error patterns were attributed to various influencing factors such as the negative transfer from Chinese consonant inventory, hypercorrection or overcompensation mistakes, deficiency of L2 teaching, and acoustic similarities. Significant overall correlations were found between the fricative perception and production by the two subject groups but were not manifested in all the eight fricatives, indicating that Chinese learners' perceptual competence of target fricatives was not necessarily tied to their productive excellence of those sounds in all cases. Furthermore, precedences of perception over production were incompletely manifested in the eight fricatives, which suggested that perception might not always be a necessary prerequisite for production. Additionally, subject group and vowel context differences were observed. The English majors performed better than the middle school students, both perceptually and productively, and the subjects' performances in perception and production varied when vowel contexts changed.

Lip-Synch System Optimization Using Class Dependent SCHMM (클래스 종속 반연속 HMM을 이용한 립싱크 시스템 최적화)

  • Lee, Sung-Hee;Park, Jun-Ho;Ko, Han-Seok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.7
    • /
    • pp.312-318
    • /
    • 2006
  • The conventional lip-synch system has a two-step process, speech segmentation and recognition. However, the difficulty of speech segmentation procedure and the inaccuracy of training data set due to the segmentation lead to a significant Performance degradation in the system. To cope with that, the connected vowel recognition method using Head-Body-Tail (HBT) model is proposed. The HBT model which is appropriate for handling relatively small sized vocabulary tasks reflects co-articulation effect efficiently. Moreover the 7 vowels are merged into 3 classes having similar lip shape while the system is optimized by employing a class dependent SCHMM structure. Additionally in both end sides of each word which has large variations, 8 components Gaussian mixture model is directly used to improve the ability of representation. Though the proposed method reveals similar performance with respect to the CHMM based on the HBT structure. the number of parameters is reduced by 33.92%. This reduction makes it a computationally efficient method enabling real time operation.

The implementation of Korean adult's optimal formant setting by Praat scripting (성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여)

  • Park, Jiyeon;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.97-108
    • /
    • 2019
  • An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.

Normalized gestural overlap measures and spatial properties of lingual movements in Korean non-assimilating contexts

  • Son, Minjung
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.31-38
    • /
    • 2019
  • The current electromagnetic articulography study analyzes several articulatory measures and examines whether, and if so, how they are interconnected, with a focus on cluster types and an additional consideration of speech rates and morphosyntactic contexts. Using articulatory data on non-assimilating contexts from three Seoul-Korean speakers, we examine how speaker-dependent gestural overlap between C1 and C2 in a low vowel context (/a/-to-/a/) and their resulting intergestural coordination are realized. Examining three C1C2 sequences (/k(#)t/, /k(#)p/, and /p(#)t/), we found that three normalized gestural overlap measures (movement onset lag, constriction onset lag, and constriction plateau lag) were correlated with one another for all speakers. Limiting the scope of analysis to C1 velar stop (/k(#)t/ and /k(#)p/), the results are recapitulated as follows. First, for two speakers (K1 and K3), i) longer normalized constriction plateau lags (i.e., less gestural overlap) were observed in the pre-/t/ context, compared to the pre-/p/ (/k(#)t/>/k(#)p/), ii) the tongue dorsum at the constriction offset of C1 in the pre-/t/ contexts was more anterior, and iii) these two variables are correlated. Second, the three speakers consistently showed greater horizontal distance between the vertical tongue dorsum and the vertical tongue tip position in /k(#)t/ sequences when it was measured at the time of constriction onset of C2 (/k(#)t/>/k(#)p/): the tongue tip completed its constriction onset by extending further forward in the pre-/t/ contexts than the uncontrolled tongue tip articulator in the pre-/p/ contexts (/k(#)t/>/k(#)p/). Finally, most speakers demonstrated less variability in the horizontal distance of the lingual-lingual sequences, which were taken as the active articulators (/k(#)t/=/k(#)p/ for K1; /k(#)t/

Acoustic Characteristics of Patients with Total Laryngectomees via Voice Rehabilitation Techniques (후두적출술 환자의 발성법에 따른 음향학적 특성)

  • Jang, Hyo-Ryung;Shim, Hee-Jeong;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.25-32
    • /
    • 2013
  • This research is aimed at finding the acoustic characteristics of different voice rehabilitation techniques, the electrolaryx (EL), standard esophageal (SE), and tracheoesophageal (TE), used on 17 patients with laryngectomees. The analysis of the voice qualities was achieved using MDVP. In order to compare the acoustic characteristics, patients were asked to produce the vowel /a/ sound. The acoustic analysis included fundamental frequency (f0), jitter, shimmer, and noise-to-harmonic ratio (NHR). The main acoustic results showed no significant statistical differences between the average measurements of SE and TE speakers. It was found that the current study showed the same tendency found in previous studies. There was also a significant difference between SE and EL speakers. On the other hand, there were no significant statistical differences between the average measurements of TE and EL speakers on all acoustic measurements. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation for patients with laryngectomees. In future, the present findings and issues should be considered in the context of gender. Specifically, the number of women who are diagnosed with laryngeal cancer continues to rise and their acoustic characteristics may indeed differ from those of men.

Perception of Spanish $/{\setminus}/$ - /r/ distinction by native Japanese

  • Mignelina Guirao Jorge A. Gurlekian;Maria A. Garcia Jurado
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.337-342
    • /
    • 1996
  • In prevoius works we have repored phonetic similarities between Japanese and Spanish voweis and syiiabic sounds. (1) (2) (3) (4). In the present communication we explore the relative importance of duration of the consonantal segment to elicit Spanish /l/ - /r/ distinction by native j Japanese talkers. Three Argentine and three trained native Japanese talkers recorded /l-r/ combined with /a/ in VCV sequences. Modifications of consonant duration and vowel context with transitions were m made by editing natural /ala/ sounds. Mixed VCV were produced by combining sounds of both languages. Perceptual tests were produced by combining sounds of both languages perceptual performed presenting the speech material, to native t trained and non trained Japanese listeners. In a tirst sessIOn a d discrimination procedure was applied. The items were arranged in pairs a and listeners Nere told to indicate the pair that sounded different. In the f following session they were asked to identify and type the letter corresponding to each one of the items. Responses arc examined in tenns of critical duration of the interval between vowels. Preliminary results indicate that the duration of intervocalic intervais was a relevant cue for the identification of /l/ and /r/. It seems that to differentiate the two sounds, Japanese listeners required relatively longer interval steps than the argentine suhjects. There was a tendency to conhlse more frequently /l/ for /r/ than viceversa.

  • PDF

A study of /l/ velarization in American English based on the Buckeye Corpus (벅아이 코퍼스를 이용한 미국 영어의 /l/ 연구개음화 연구)

  • Sa, Jae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.19-25
    • /
    • 2021
  • It has been widely recognized that there are two varieties of lateral liquid /l/, which are light /l/ (a non-velarized allophone) and dark /l/ (a velarized allophone). However, this categorical view has been challenged in recent studies, both on articulatory and acoustic aspects. The purpose of this study is to investigate whether to consider /l/ velarization as a continuum in American English and provide supporting data. A spontaneous American English speech database called the Buckeye Speech Corpus was used for the material. The formant frequencies of /l/ in each syllable position were measured and analyzed statistically. The formant frequencies of /l/ in each syllable position, especially F2 values, were significantly different from each other. The results showed that there were other significantly different varieties of /l/ in American English, which support the continuum view on /l/ velarization. Regarding the effect of the adjacent vowel, the backness of the adjacent vowels was shown to affect the degree of /l/ velarization, regardless of the syllable position of the lateral liquid. This result will help provide a solid ground for the continuum view.

Speech Stimuli on the Diagnostic Evaluation of Speech with Cleft Lip and Palate : Clinical Use and Literature Review (구개열 환자 말 평가 시 검사어에 대한 고찰 : 임상현장의 말 평가 어음자료와 문헌적 고찰을 중심으로)

  • Choi, Seong-Hee;Choi, Jae-Nam;Nam, Do-Hyun;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.33-48
    • /
    • 2005
  • Differential diagnosis of articulation and resonance problems in the cleft lip and palate speech is required for evaluating various factors contribute to speech problems such as VPI, dental occlusion, palatal fistulae, learning. However, validity of speech stimuli is current issue to evaluate accurately each problem in cleft speech. This study was conducted to investigate speech stimuli using in the clinical setting and review the literatures and articles published 1990 to 2005 for helping develop standardized speech samples. The results were recommendation to evaluate properly velopharyngeal function when conducting a diagnostic evaluation as follows : 1) In identification hypernasality, the speech stimuli should be included low pressure consonants to eliminate effects of nasal emission, compensatory articulation. 2) Speech stimuli should be consist of visual, front sounds to eliminate compensatory articulation and to stimulate easily. 3) Regarding early diagnosis and treatment, speech stimuli need to develop for infants and preschooler. 4) Stimulus length on nasalance scores should be at least 6 syllables. 5) In phonetic context on nasalance scores, /i/ vowel should be take into consideration excluding paragraph. 6) Connected speech stimuli should be developed for evaluating intelligibility and VP function.

  • PDF