• Title/Summary/Keyword: Glottal

Search Result 138, Processing Time 0.021 seconds

Glottal Characteristics of Word-initial Vowels in the Prosodic Boundary: Acoustic Correlates (운율경계에 위치한 어두 모음의 성문 특성: 음향적 상관성을 중심으로)

  • Sohn, Hyang-Sook
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.47-63
    • /
    • 2010
  • This study provides a description of the glottal characteristics of the word-initial low vowels /a, $\ae$/ in terms of a set of acoustic parameters and discusses glottal configuration as their acoustic correlates. Furthermore, it examines the effect of prosodic boundary on the glottal properties of the vowels, seeking an account of the possible role of prosodic structure based on prosodic theory. Acoustic parameters reported to indicate glottal characteristics were obtained from the measurements made directly from the speech spectrum on recordings of Korean and English collected from 45 speakers. They consist of two separate groups of native Korean and native English speakers, each including both male and female speakers. Based on the three acoustic parameters of open quotient (OQ), first-formant bandwidth (B1), and spectral tilt (ST), comparisons were made between the speech of males and females, between the speech of native Korean and native English speakers, and between Korean and English produced by native Korean speakers. Acoustic analysis of the experimental data indicates that some or all glottal parameters play a crucial role in differentiating the speech groups, despite substantial interspeaker variations. Statistical analysis of the Korean data indicates prosodic strengthening with respect to the acoustic parameters B1 and OQ, suggesting acoustic enhancement in terms of the degree of glottal abduction and the glottal closure during a vibratory cycle.

  • PDF

Glottal Area and Voice Onset Time

  • Kim, Dae-Won
    • MALSORI
    • /
    • no.15_18
    • /
    • pp.19-34
    • /
    • 1989
  • There is general agreement that voice onset time (VOT) is functionally related with the glottal opening at the moment of the oral release of a stop. However, systematic investigations of tempo 8n4 the place of articulation as affecting the glottal opening and VOT have relatively neglected. Various instrumental techniques were used to verify the claim with BrEng and korean speakers, under controlled experimental conditions, tempo being one of them. It was found that voiceless aspiration (i.e. VOT) is not simply a function of the glottal area at the moment of the oral release of a stop as it is normally defined in the existing literature. Within a given place of articulation and across temper VOT was generally insignificantly related to the glottal area. It is inferred that the glottal adduction onset time for the following vowel is actively control led by the speaker to meet aerodynamic requirements in relation to class (i.e. aspirated and unaspirated) and tempo. Some possible underlying physiological mechanisms for various phonetic aspects of intervocalic stops, associated with the glottal area and VOT, were discussed.

  • PDF

Glottal Weighted Cepstrum for Robust Speech Recognition (잡음에 강한 음성 인식을 위한 성문 가중 켑스트럼에 관한 연구)

  • 전선도;강철호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.5
    • /
    • pp.78-82
    • /
    • 1999
  • This paper is a study on weighted cepstrum used broadly for robust speech recognition. Especially, we propose the weighted function of asymmetric glottal pulse shape. which is used for weighted cepstrum extracted by PLP(Perceptual Linear Predictive) based on auditory model. Also, we analyze this glottal weighted cepstrum from the glottal pulse of glottal model in connection with the cepstrum. And we obtain speech features analyzed by both the glottal model and the auditory model. The isolated-word recognition rate is adopted for the test of proposed method in the car moise and street environment. And the performance of glottal weighted cepstrum is compared with both that of weighted cepstrum extracted by LP(Linear Prediction) and that of weighted cepstrum extracted by PLP. The result of computer simulation shows that recognition rate of the proposed glottal weighted cepstrum is better than those of other weighted cepstrums.

  • PDF

Effect of Glottal Wave Shape on the Vowel Phoneme Synthesis (성문파형이 모음음소합성에 미치는 영향)

  • 안점영;김명기
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.10 no.4
    • /
    • pp.159-167
    • /
    • 1985
  • It was demonstrated that the glottal waves are different depending on a kind of vowels in deriving the glottal waves directly from Korean vowels/a, e, I, o, u/ w, ch are recorded by a male speaker. After resynthesizing vowels with five simulated glottal waves, the effects of glottal wave shape on the speech synthesis were compared with in terms of waveform. Some changes could be seen in the waveforms of the synthetic vowels with the variation of the shape, opening time and closing time, therefore it was confirmed that in the speech sysnthesis, the glottal wave shape is an important factor in the improvement of the speech quality.

  • PDF

The Comparison of Glottal Area Waveform between Normal Person and Patient with Vocal Lesions (정상인과 후두질환 환자에서 Glottal Area Waveform의 차이에 관한 연구)

  • Yoo, Young-Sam;Rosen, Clark A.
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.5-9
    • /
    • 2005
  • Background and Objectives : Glottal area waveform(GAW) shows the plot of glottal area versus time through the 1 cycle. This study is designed to see how it can be applied to laryngeal patients. Material and Methods : A GAW analysis was peformed on 11 patients before and after surgery and 22 persons without laryngeal problems were recruited for control. Acoustic and aerodynamic analysis with VHI(voice handicap index) evaluation were performed. Results. Significant changes in baseline offset, gross closing rate and VHI were observed postoperatively. Other parameters were changed but insignificant. Conclusion : GAW maybe used to evaluate patient with voice problems.

  • PDF

A Study on Glottal Spectrum Analysis According to the Distance between the Microphone and the lips (Microphone 거리에 따른 Glottal Spectrum 성분 분석에 관한 연구)

  • Park Hyunyoung;Jang Kyunga;Bae Myungjin
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.65-68
    • /
    • 2002
  • 현재 음성인식기는 다 채널의 음성입력방식을 사용하고 있는 추세이다. 이런 방법으로 음성인식기를 사용할 때에 자동적으로 음성을 검출하는 음성입력 방식은 발성자와 마이크간의 거리에 따라 Glottal Spectrum 성분이 변하는 특성을 가지고 있다. 이러한 Glottal Spectrum 성분은 a=R1/R0 (LPC 포락선의 기울기) 로 나타낼 수 있다. 본 논문에서는 발성자와 마이크 거리에 따른 Glottal Spectrum 성분을 비교 분석 하고자 한다.

  • PDF

Treatment of a Case with Dysphonia Due to Large Posterior Glottal Chink Using Arytenoid Adduction and Type I Thyroplasty (피열연골내전술과 제1형 갑상성형술을 이용한 성문후부부전에 의한 발성장애의 치험 1례)

  • 최홍식;최재진;조정일;김광문
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.6 no.1
    • /
    • pp.39-42
    • /
    • 1995
  • Surgical treatment options of symptomatic unilateral vocal fold paralysis are Teflon injection, type Ⅰ thyroplasty, and arytenoid adduction. Arytenoid adduction is preferable to type Ⅰ thyroplasty for correcting the level different that may be present between two vocal folds and the large glottal chink However there is no known therapeutic modality effective to correct the large posterior glottal chink of the vocal fold with relatively normal mobility. Recently we have experienced a case of severe large posterior glottal chink of the vocal 1314s with relatively normal mobility after thyroid lobectomy, successfully treated with type Ⅰ thyroplasty combined with arytenoid adduction.

  • PDF

Quantitative Measurement of the Glottal Area Waveform(GAW) in Unilateral Vocal Fold Paralysis (편측성대마비환자에서의 성문면적파형(Glottal Area Waveform)의 정량적 측정)

  • 최홍식;김명상;최재영;안성윤;이세영;홍정표
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.9 no.1
    • /
    • pp.71-78
    • /
    • 1998
  • Type Ⅰ thyuroplasty in conjunction with arytenoid adduction is one of the excellent techniques in the treatment of unilateral vocal fold paralysis. But perioperative objective evaluation of the patients is difficult. With the development of the videostroboscopy and image analysis program, we could quantify the Glottal Area Waveform(GAW) in patients with unilateral vocal fold paralysis and investigated the relationship between the glottal area and aerodynamic and acoustic parameters. Eight female patients who were performed type Ⅰ thyroplasty in conjunction with arytenoid adduction and 5 females with normal vocal function were involved in this study. Preoperative and postoperative videostroboscopy and vocal function study wire performed. GAW was analysed quantitatively with image analysis program (Kay Stroboscope Image analysis, KSIP) Peak Glottal Area(PGA), Baseline Offset(BO), and Closing Phase(CP) were increased in patients with unilateral vocal fold paralysis and they were reduced after the operation. Mean flow Rate (MFR) was well correlated with the PGA in normal control group and unilateral vocal fold paralysis patients. Noise to harmonic ratio(NHR) was correlated with PGA only in preoperative unilateral vocal fold paralysis patients. In conclusion quantitative measurement of the GAW is useful method in evaluation of unilateral vocal f31d paralysis patients.

  • PDF

The Role of the Electroglottography on the Laryngeal Articulation of Speech (전기 Glottography(EGG)를 이용한 후두구음역학적 특성)

  • 홍기환;박병암;양윤수;서수영;김현기
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.8 no.1
    • /
    • pp.18-26
    • /
    • 1997
  • There are two types of phonetic study, acoustic and physiologic, for differentiating the three manner categories of Korean stop consonants. On the physiologic studies, there are endoscopic, electromyographic(EMG), electroglottographic(EGG) and aerodynamic studies. In this study, I tried to investigate general features of Korean stops using EGG study for the open quotient of vocal fold and baseline shift during speech, and aerodynamic characteristics for e subglottal air pressure, air flow and glottal resistance at consonants. On the aerodynamic study, the glottalized and aspirated stops may be characterized by e increasing subglottal pressure comparing with lenis stop at consonants. The airflow is largest in the aspirated stops followed by lenis stops and glottalized. The glottal airway resistance (GAR) showed highest in the glottalized followed by the lenis, but lowest in e aspirated during e production of consonants, and showed highest in e aspirated, but low in the glottalized and lenis during the production of vowel. The glottal resistance at consonant showed significant difference among consonants and significant interaction between subject and types of consonant. The glottal resistance at vowel showed significant difference among consonants, and e interaction occured between subject and types of consonant. The electroglottography(EGG) has been used for investigating e functioning of e vocal folds during its vibration. The EGG should be related to the patterns of the vocal fold vibration during phonation in characterizing the temporal patterns of each vibratory cycle. The purpose of this study is to investigate the dynamic change of EGG waveforms during continuous speech. The dynamic changes of EGG waveforms fir the three-way distinction of Korean stops were characterized that the aspirated stop appears to be characterized by largest open quotient and smallest glottal contact area of the vocal folds in e initial portion of vocal fold vibration ; the lenis stop by moderate open quotient and glottal contact area ; but the glottalized stop by smallest open quotient and largest glottal contact area. There may be close relationship between the OQ(open quotient) in the initial voice onset and the glottal width at the time of consonant production, the larger glottal width just before vocal fold vibration results in the smaller OQ of the vocal fold vibration in the initial voice onset. The EGG changes of baseline shift during continuous speech production were characterized by the different patterns for the three types of Korean consonants. The small and less stiffness change of baseline shift was found for the lenis and the glottalized, and the largest and stiffest change was found for the aspirated. On the baseline shift for the initial voice onset, they showed so similar patterns with for the consonant production, larger changed in the aspirated. for the lenis and the glottalized during the initial voice onset, three subjects showed individual difference each other. I suggest at s characteristics were strongly related with articulatory activity of vocal tract for the production of consonant, especially for the aspirated stop. The suspecting factors to affect EGG waveforms are glottal width, vertical laryngeal movement and the intrapharyngeal pressure to neighboring tissue during connected spech. So the EGG may be an useful method to describe laryngeal activity to classify pulsing conditions of the larynx during speech production, and EGG research can be controls for monitoring the vocal tract articulation, although above factors to affect EGG would have played such a potentially role on vocal fold vibratory behavior obtained using consonant production.

  • PDF

Performance Improvement of Speaker Recognition Using Enhanced Feature Extraction in Glottal Flow Signals and Multiple Feature Parameter Combination (Glottal flow 신호에서의 향상된 특징추출 및 다중 특징파라미터 결합을 통한 화자인식 성능 향상)

  • Kang, Jihoon;Kim, Youngil;Jeong, Sangbae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.12
    • /
    • pp.2792-2799
    • /
    • 2015
  • In this paper, we utilize source mel-frequency cepstral coefficients (SMFCCs), skewness, and kurtosis extracted in glottal flow signals to improve speaker recognition performance. Generally, because the high band magnitude response of glottal flow signals is somewhat flat, the SMFCCs are extracted using the response below the predefined cutoff frequency. The extracted SMFCC, skewness, and kurtosis are concatenated with conventional feature parameters. Then, dimensional reduction by the principal component analysis (PCA) and the linear discriminat analysis (LDA) is followed to compare performances with conventional systems under equivalent conditions. The proposed recognition system outperformed the conventional system for large scale speaker recognition experiments. Especially, the performance improvement was more noticeable for small Gaussan mixtures.