Search | Korea Science

Flattening Techniques for Pitch Detection (피치 검출을 위한 스펙트럼 평탄화 기법)

김종국;조왕래;배명진
- Proceedings of the IEEK Conference
- /
- 2002.06d
- /
- pp.381-384
- /
- 2002
In speech signal processing, it Is very important to detect the pitch exactly in speech recognition, synthesis and analysis. but, it is very difficult to pitch detection from speech signal because of formant and transition amplitude affect. therefore, in this paper, we proposed a pitch detection using the spectrum flattening techniques. Spectrum flattening is to eliminate the formant and transition amplitude affect. In time domain, positive center clipping is process in order to emphasize pitch period with a glottal component of removed vocal tract characteristic. And rough formant envelope is computed through peak-fitting spectrum of original speech signal in frequency domain. As a results, well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. After all, we obtain residual signal which is removed vocal tract element The performance was compared with LPC and Cepstrum, ACF 0wing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.
PDF

Vocal Fold Videokymography: New Approach for the Analysis of Vocal Fold Vibratory Pattern

Lee, J.S.;Kim, E.J.;Yi, W.J.;Park, K.S.;Sung, M.Y.;Sung, M.H.;Kim, K.H.
- Proceedings of the KOSOMBE Conference
- /
- v.1997 no.05
- /
- pp.313-315
- /
- 1997
We developed a new analysis technique for the assessment of irregular vibratory movement of vocal folds. Successive frames of pre-recorded video images from videostroboscopy were transferred to computer memory and a vibratory tract of one selected point was described as a waveform by displaying the same lines of all frames along the y-direction. By applying this technique, irregular vibratory patterns of multiple regions, such as asynchronized registration of glottal cycles, could be easily visualized. It would be possible to monitor and analyze the pathologic changes of vocal fold movement by means of this newly developed system.
PDF

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
- Speech Sciences
- /
- v.10 no.2
- /
- pp.85-95
- /
- 2003
In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.
PDF

Quasi-periodic waveform analysis for diplophonia (이중음성에 대한 음성파형분석)

홍기환;김미정;정상술
- Proceedings of the KOR-BRONCHOESO Conference
- /
- 1993.05a
- /
- pp.71-71
- /
- 1993
Diplophonia is produced by the voice of two separate tones and produced through quasi-periodic variations in the vocal cord vibration. Diplophonia is generally regarded as a symptom of laryngeal pathology. The difference in the vibratory frequency between the vocal cords can be seen in a tension imbalance and a difference in the level of the vocal folds under the special condition such as incomplete glottal closure. So authors have experienced 19 cases of patient with diplophonia for the unilateral vocal cord paralysis, intracordal cysts and other mass lesions. And we analysed the diplophonic voice with peak variability and noise level for the quasi-periodic waveforms and spectrograms pre-and postoperatively.
PDF

The Characteristics of Voice Onset Time of the Korean Stops in the Benign Laryngeal Disorders (후두질환에 따른 자음의 음성발현시간의 특성)

Hong, Ki-Hwan;Lee, Hwa-Uk;Kim, Jin-Sung;Lee, Eun-Jung;So, Sang-Soo;Choi, Dong-Il;Ynng, Yoon-Soo
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.17 no.2
- /
- pp.98-102
- /
- 2006
Background and Objectives : Voice onset time(VOT) is defined as the time interval from oral release of a stop consonant to the onset of glottal pulsing in the following vowel. VOT is a temporal characteristics of stop consonants that reflects the complex timing of glottal articulation relative to supraglottal articulation. Stop consonants are characterized by creation of a pressure difference across a complete occlusion in the vocal tract, followed by a sudden release 'burst' due to opening that occlusion. The objects of this study is to evaluate a usefulness of voice onset time in the assessment of voice disorderd patients. Subjects : Subjects were 20 adults with normal voice and with benign laryngeal disorders. Subjects with voice disorders represented the following vocal pathologies : vocal polyp, vocal nodule, Reinke's edema and unilateral vocal fold paralysis(UVFP). Control subjects were matched for age (21-40 yews old) and sex(male) with the voice disorders subjects and had normal vocal qualities with no history of voice disorders. Methods : Each voice-disordered and matched control subject read the test passages containing three types of Korean bilabial consonants. VOT measures were made for the initial $/p/p^h/\;and\;/p'/$. VOT was measured using acoustic waveform or wide band spectrogram. Results : For each voiceless stop consonants, there was a significant difference in VOT between the voice disordered and normal subjects. The mean VOTs of the lax stops in UVFP was significantly shorter than those of control subjects in the UVFP. The mean VOTs of the aspirated stops in the vocal polyp and nodule were longer than those of control subjects, but not significant. The mean VOTs of the glottalized in voice disordered groups were longer than those of control subjects, and significant statistically in the UVFP. Conclusions : VOT may be a clinically useful acoustic parameter in the assessment of voice disordered patients, especially in the unilateral vocal fold paralysis.
PDF

Analysis of Phonatory Aerodynamic & Electroglottography of a Countertenor (Countertenor 1인의 Modal Register와 Falsetto Register에서의 공기역학적 변화 및 전기성문파형의 변화 연구)

Nam, Do-Hyun;Choi, Seong-Hee;Choi, Jae-Nam;Choi, Hong-Shik
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.17 no.1
- /
- pp.43-48
- /
- 2006
Background and Objectives: Countertenors who can produce higher vocal pitch like female classical singer's voice and use both modal and falsetto register. This study was conducted to study phonatory characteristics between modal and falsetto register of the countertenor. Materials and Methods: A male countertenor who had 8 years of experience was examined using a videostroboscopy and his voice was analyzed using aerodynamic measures; fundamental frequency(F0), Mean air flow rate(MFR), intensity(SLP), subglottal air pressure(Psub) with phonatory function analyzer(Nagashima) and acoustic measures; jitter, shimmer, HNR, closed quotient(CQ) using a Electro-glottography(EGG) of Lx. Speech Studio(Laryngoscope, Ltd, UK) and voice range profile of CSL(Kay elemetrics). Results: In the stroboscopy finding, the longitudinal length of vocal folds was increased at the falsetto register and the upper margin of vocal folds vibrated with incomplete closure of true vocal folds. In aerodynamic analysis, intensity was same at the modal and falsetto register. However, MFR, Psub, MPT were higher at the falsetto register. In the electroglottographic analysis, closed quotient(CQ) at the modal register was high and also much higher at the high-pitch falsetto than at the loud falsetto. In the VRP, intensity was similar though F0 was different between modal and falsetto register. Conclusion: It implied that countertenor could produce powerful voice quality by increasing of respiratory pressure and respiratory volume though glottal closure was incomplete. In addition, no change of EGG waveform, similar voice range with alto was observed.
PDF

Search Result 16, Processing Time 0.047 seconds

Flattening Techniques for Pitch Detection (피치 검출을 위한 스펙트럼 평탄화 기법)

Vocal Fold Videokymography: New Approach for the Analysis of Vocal Fold Vibratory Pattern

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

Quasi-periodic waveform analysis for diplophonia (이중음성에 대한 음성파형분석)

The Characteristics of Voice Onset Time of the Korean Stops in the Benign Laryngeal Disorders (후두질환에 따른 자음의 음성발현시간의 특성)

Analysis of Phonatory Aerodynamic & Electroglottography of a Countertenor (Countertenor 1인의 Modal Register와 Falsetto Register에서의 공기역학적 변화 및 전기성문파형의 변화 연구)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)