• Title/Summary/Keyword: articulatory features

Search Result 17, Processing Time 0.023 seconds

Linguistic Features of Spontaneous Speech Production in Normal Aging, Alzheimer's Disease (정상 노인과 알츠하이머성 치매 환자의 자발화 산출에서의 언어적 특징)

  • Kim, Jung Wan
    • 한국노년학
    • /
    • v.32 no.3
    • /
    • pp.747-758
    • /
    • 2012
  • Detecting probable Alzheimer's disease (AD) at an early stage is crucial in slowing the progression of the disease and initiating drug therapy for more effective symptom management. Therefore, this study aimed to identify linguistic features that allow us to distinguish between patients with AD and normal controls. This paper reports on characteristics of spontaneous speech in subjects in three stages of AD (questionable, mild, moderate) compared with education- and age-matched normal controls. Four components of speech were measured in Korean native speakers with AD and normal aging: speech tempo, hesitation (measured in seconds), rate of articulation errors, and rate of grammatical errors. The results revealed significant differences in most of these speech components among the four groups, including significant differences between normal controls and the questionable AD group in the areas of speech tempo and rate of grammatical errors. Phonological? articulatory ability was preserved in questionable AD, and grammatical ability was preserved in questionable and mild AD. Subjects with moderate AD were severely impaired in grammatical ability. Prospective assessments of spontaneous speech skills using a dialogue and picture-description task are useful in detecting the subtle, spontaneous speech impairments that AD causes even in its early stage.

Formant Trajectories of English Vowels Produced by American Children (미국인 아동이 발음한 영어모음의 포먼트 궤적)

  • Yang, Byung-Gon
    • Phonetics and Speech Sciences
    • /
    • v.3 no.1
    • /
    • pp.23-34
    • /
    • 2011
  • Many Korean children have difficulty learning English vowels. The gestures inside the oral and pharyngeal cavities are hard to control when they cannot see the gestures and the target vowel system is quite different from that of Korean. This study attempts to collect children's acoustic data of twelve English vowels published by Hillenbrand et al. (1995) online and to examine the acoustic features of English vowels for phoneticians and English teachers. The author used Praat to obtain the data systematically at six equidistant timepoints over the vowel segment avoiding any obvious errors. Results show inherent acoustic properties for vowels from the children's distribution of vowel duration, f0 and intensity values. Second, children's gestures for each vowel coincide with the regression analysis of all formant values at different timepoints regardless of the vocal fold and tract difference. Third, locus points appear higher than those of American males and females. Their gestures along the timepoints display almost similar patterns. From the results the author concludes that vowel formant trajectories provide useful and important information on dynamic articulatory gestures, which may be applicable to Korean children's education and correction of English vowels. Further studies on the developmental study of vowel formants and pitch values are desirable.

  • PDF

Speech Rate and the Acoustic Features of Korean Segments (발화속도와 한국어 분절음의 음향학적 특성)

  • 이숙향;고현주
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.2
    • /
    • pp.162-172
    • /
    • 2004
  • This study investigates the following three things through a production experiment and acoustic analysis: 1) relationship between speech rate and the segment duration in Korean, 2) relationship between speech rate and spectral characteristics of vowels, i. e. undershoot, and 3) correlation between the vowel duration and undershoot. The results showed that the faster the speech rate nab, the shorter the duration of syllables and segments was. A few speakers were affected by speech rate in the durational ratios between closure and aspiration in a stop and between Towel and consonant in a syllable. Closure duration and vowel duration were more affected compared to aspiration and consonant duration, respectively. Speakers showed some differences in the extent to which speech rate affected vowel undershoot, implying that speakers used different production mechanisms for spectral characteristics of vowels: Some speakers speeded up movement of articulatory organs according to speech rate increase while some kept it constant regardless of speech rate change.

Speech synthesis using acoustic Doppler signal (초음파 도플러 신호를 이용한 음성 합성)

  • Lee, Ki-Seung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.35 no.2
    • /
    • pp.134-142
    • /
    • 2016
  • In this paper, a method synthesizing speech signal using the 40 kHz ultrasonic signals reflected from the articulatory muscles was introduced and performance was evaluated. When the ultrasound signals are radiated to articulating face, the Doppler effects caused by movements of lips, jaw, and chin observed. The signals that have different frequencies from that of the transmitted signals are found in the received signals. These ADS (Acoustic-Doppler Signals) were used for estimating of the speech parameters in this study. Prior to synthesizing speech signal, a quantitative correlation analysis between ADS and speech signals was carried out on each frequency bin. According to the results, the feasibility of the ADS-based speech synthesis was validated. ADS-to-speech transformation was achieved by the joint Gaussian mixture model-based conversion rules. The experimental results from the 5 subjects showed that filter bank energy and LPC (Linear Predictive Coefficient) cepstrum coefficients are the optimal features for ADS, and speech, respectively. In the subjective evaluation where synthesized speech signals were obtained using the excitation sources extracted from original speech signals, it was confirmed that the ADS-to-speech conversion method yielded 72.2 % average recognition rates.

Lip and Voice Synchronization Using Visual Attention (시각적 어텐션을 활용한 입술과 목소리의 동기화 연구)

  • Dongryun Yoon;Hyeonjoong Cho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.4
    • /
    • pp.166-173
    • /
    • 2024
  • This study explores lip-sync detection, focusing on the synchronization between lip movements and voices in videos. Typically, lip-sync detection techniques involve cropping the facial area of a given video, utilizing the lower half of the cropped box as input for the visual encoder to extract visual features. To enhance the emphasis on the articulatory region of lips for more accurate lip-sync detection, we propose utilizing a pre-trained visual attention-based encoder. The Visual Transformer Pooling (VTP) module is employed as the visual encoder, originally designed for the lip-reading task, predicting the script based solely on visual information without audio. Our experimental results demonstrate that, despite having fewer learning parameters, our proposed method outperforms the latest model, VocaList, on the LRS2 dataset, achieving a lip-sync detection accuracy of 94.5% based on five context frames. Moreover, our approach exhibits an approximately 8% superiority over VocaList in lip-sync detection accuracy, even on an untrained dataset, Acappella.

The Role of the Electroglottography on the Laryngeal Articulation of Speech (전기 Glottography(EGG)를 이용한 후두구음역학적 특성)

  • 홍기환;박병암;양윤수;서수영;김현기
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.8 no.1
    • /
    • pp.18-26
    • /
    • 1997
  • There are two types of phonetic study, acoustic and physiologic, for differentiating the three manner categories of Korean stop consonants. On the physiologic studies, there are endoscopic, electromyographic(EMG), electroglottographic(EGG) and aerodynamic studies. In this study, I tried to investigate general features of Korean stops using EGG study for the open quotient of vocal fold and baseline shift during speech, and aerodynamic characteristics for e subglottal air pressure, air flow and glottal resistance at consonants. On the aerodynamic study, the glottalized and aspirated stops may be characterized by e increasing subglottal pressure comparing with lenis stop at consonants. The airflow is largest in the aspirated stops followed by lenis stops and glottalized. The glottal airway resistance (GAR) showed highest in the glottalized followed by the lenis, but lowest in e aspirated during e production of consonants, and showed highest in e aspirated, but low in the glottalized and lenis during the production of vowel. The glottal resistance at consonant showed significant difference among consonants and significant interaction between subject and types of consonant. The glottal resistance at vowel showed significant difference among consonants, and e interaction occured between subject and types of consonant. The electroglottography(EGG) has been used for investigating e functioning of e vocal folds during its vibration. The EGG should be related to the patterns of the vocal fold vibration during phonation in characterizing the temporal patterns of each vibratory cycle. The purpose of this study is to investigate the dynamic change of EGG waveforms during continuous speech. The dynamic changes of EGG waveforms fir the three-way distinction of Korean stops were characterized that the aspirated stop appears to be characterized by largest open quotient and smallest glottal contact area of the vocal folds in e initial portion of vocal fold vibration ; the lenis stop by moderate open quotient and glottal contact area ; but the glottalized stop by smallest open quotient and largest glottal contact area. There may be close relationship between the OQ(open quotient) in the initial voice onset and the glottal width at the time of consonant production, the larger glottal width just before vocal fold vibration results in the smaller OQ of the vocal fold vibration in the initial voice onset. The EGG changes of baseline shift during continuous speech production were characterized by the different patterns for the three types of Korean consonants. The small and less stiffness change of baseline shift was found for the lenis and the glottalized, and the largest and stiffest change was found for the aspirated. On the baseline shift for the initial voice onset, they showed so similar patterns with for the consonant production, larger changed in the aspirated. for the lenis and the glottalized during the initial voice onset, three subjects showed individual difference each other. I suggest at s characteristics were strongly related with articulatory activity of vocal tract for the production of consonant, especially for the aspirated stop. The suspecting factors to affect EGG waveforms are glottal width, vertical laryngeal movement and the intrapharyngeal pressure to neighboring tissue during connected spech. So the EGG may be an useful method to describe laryngeal activity to classify pulsing conditions of the larynx during speech production, and EGG research can be controls for monitoring the vocal tract articulation, although above factors to affect EGG would have played such a potentially role on vocal fold vibratory behavior obtained using consonant production.

  • PDF

Fiberscopic and Electromyograpic Study on Laryngeal Adjustments for Syllable-final Applosives in Korean (한국어의 음절말 내파음의 후두조절 -화이비스코프 및 근전도에 의한 관찰-)

  • Park, Hea-Suk
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.53-67
    • /
    • 2005
  • It is known that Korean stop consonants in syllable-initial position are of three types : lax, aspirated and forced (or unaspirated). In syllable-final position, however, these three different types are merged to a single type with the same place of articulation, although the original three-way distinction is preserved in Korean orthographic (Hangul) system. Thus the syllable-final stops are phonetically realized as voiceless "applosives" which are characterized by the absence of oral release. The aim of the present study is to investigate the laryngeal adjustments for these syllable-final stops in various phonological conditions by using fiberscope, and, is to further investigate electromyographically the laryngeal adjustments for Korean stops both in the syllable-initial and final positions in various phonological conditions. The results can be summarized as follows : 1. In the case of syllable-initial stops, the glottal widths in each three types of the Korean stops during the articulatory closure are clearly different. And the pattern of thyroarytenoid(VOC) activity appeared to characterize the three different types of Korean stops. 2. The basic laryngeal feature of the Korean syllable-final applosives is characterized by a small degree of glottal opening which begins at or slightly after the oral closure. 3. In the case, syllable-final stop followed by the copula "ita", the syllable- final stop is pronounced as the stop consonant at the initial position of the following syllable containing the vowel[i], the underlying features of three-way distinction for the stops in the Korean orthographic(Hangul) system being manifested at the laryngeal adjustment. 4. In the case of the final applosives followed by the initial stops and fricatives, the laryngeal feature of the final applosives appears to be assimilated by that of the following consonant irrespective of the difference in the place of articulation, as far as the glottal abduction/adduction is concerned. It is clearly demonstrated in the case of syllable-initial stop that thyoarytenoid(VOC) activity is suppressed for the production of the stop consonants in question, the degree of which is slightest for the forced type and most marked for the aspirated type, while it is moderate for the lax type.

  • PDF