Search | Korea Science

Text-Independent Speaker Recognition Using Glottal Flow Waveform (성문파형을 이용한 문장독립 화자 인식기)

Yang Ki-Hyuk;Jeon Bumki;Baek SeongJoon;Kang Sang-Ki;Sung Koeng-Mo
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.57-60
- /
- 1999
본 논문에서는 성문파에서 화자특성 계수를 추출하여 화자 인식기에 적용하고자 한다. 공분산 방법으로 음성의 잔류신호를 추정하고 이를 적분하여 성문파를 얻어낸다. 하나의 성문파 구간을 성문닫힘순간 사이가 아닌 잔류신호의 오차가 최대가 되는 순간 사이로 잡았다. 구해진 성문파를 M개의 데이터로 다시 샘플링하여 특성 벡터로 삼고 VQ기반 인식기를 사용하여 인식률을 측정하였다. 4초의 test data와 30차의 특성벡터를 사용한 경우 남성의 경우 평균 $96.08\%$, 여성에 대하여 $93.61\%$의 평균 인식률을 얻었다.
PDF

Effect of Glottal Wave Shape on the Vowel Phoneme Synthesis (성문파형이 모음음소합성에 미치는 영향)

안점영;김명기
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.10 no.4
- /
- pp.159-167
- /
- 1985
It was demonstrated that the glottal waves are different depending on a kind of vowels in deriving the glottal waves directly from Korean vowels/a, e, I, o, u/ w, ch are recorded by a male speaker. After resynthesizing vowels with five simulated glottal waves, the effects of glottal wave shape on the speech synthesis were compared with in terms of waveform. Some changes could be seen in the waveforms of the synthetic vowels with the variation of the shape, opening time and closing time, therefore it was confirmed that in the speech sysnthesis, the glottal wave shape is an important factor in the improvement of the speech quality.
PDF

Estimation of Glottal waveform (성문파의 추정)

Lee, Jung-Chul;Ann, Sou-Guil
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.3
- /
- pp.83-93
- /
- 1992
PDF

Selective Low-Pass Filtering Method on Estimation of Voice Source Parameters (음원변수 추출에서 선택적 저역통과필터링)

엄기완
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.238-241
- /
- 1998
성문파 신호로부터 음원변수들을 추출하는 방법과 그 전 단계에서 역 필터링 방법에 의해 구한 미분성문파 신호로부터 고주파 잡음을 제거하기 위해 음원구간에 따라 필터의 대역폭을 달리함으로서 음원변수 추출과정에서 저역통과 필터에 의해 발생할 수 있는 오차를 최소화하기 위한 선택적 저역통과 필터링 방법을 제안한다. 이 방법은 음원모델중 하나인 LF-model 펄스를 합성하여 필터링 함으로서 그 성능을 비교, 평가하였다.
PDF

Energy-Dependent Preemphasis for Speech Signal Preprocessing (음성신호 전처리를 위한 에너지 의존 프리엠퍼시스)

Kim, Dong-Jun;Park, Sang-Hui
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.3
- /
- pp.18-25
- /
- 1997
This study describes a modified preemphasis formula, what we call energy-dependent preemphasis(EDP). This uses the normalized short-term energy of speech signal, with the assumption that the source characteristics of the glottal pulses and the radiation characteristics of the lips are approximately proportional to the energy of speech signal. Using this method, speech analyses, such as AR spectrum estimation, formant detection, are performed for nonstationary starting parts of 5 Korean single vowels. The results are compared with the conventional two preemphasis methods. We found that the proposed preemphasis gave enhanced spectral shapes and more accurate formant frequencies and avoided overlapping phenomenon of adjacent two formants.
PDF

Implementation and Evaluation of Electroglottograph System (전기성문전도(EGG) 시스템의 개발 및 평가)

김기련;김광년;왕수건;허승덕;이승훈;전계록;최병철;정동근
- Journal of Biomedical Engineering Research
- /
- v.25 no.5
- /
- pp.343-349
- /
- 2004
Electroglottograph(EGG) is a signal recorded from the vocal cord vibration by measuring electrical impedance across the vocal folds through the neck skin. The purpose of this study was to develop EGG system and to evaluate possibility for the application on speech analysis and laryngeal disease diagnosis. EGG system was composed of two pairs of ring electrodes, tuned amplifier, phase sensitive detector, low pass filter, and auto-gain controller. It was designed to extract electric impedance after detecting by amplitude modulation method with 2.7MHz carrier signal. Extracted signals were transmitted through line-in of PC sound card, sampled and quantized. Closed Quotient(CQ), Speed Quotient(SQ), Speed Index(SI), fundamental frequency of vocal cord vibration(F0), pitch variability of vocal fold vibration (Jitter), and peak-to-peak amplitude variability of vocal fold vibration(Shimmer) were analyzed as EGG parameters. Experimental results were as follows: the faster vocal fold vibration, the higher values in CQ parameter and the lower values in SQ and SI parameters. EGG and speech signals had the same fundamental frequency. CQ, SQ, and SI were significantly different between normal subjects and patients with laryngeal cancer. These results suggest that it is possible to implement portable EGG system to monitor the function of vocal cord and to test functional changes of the glottis.
PDF KSCI

A Study on the Slop Compensation of Speech Spectrum by QMF(Quadrature Mirror Filter) (QMF Filter에 의한 음성스펙트럼 평탄화에 관한 연구)

Jun, Woo-Jin
- Proceedings of the KAIS Fall Conference
- /
- 2010.05a
- /
- pp.273-276
- /
- 2010
음성신호를 관찰하였을 때 성문특성으로 인해서 고주파쪽 특성이 약화되는 경향이 있다. 약화된 고주파 특성을 보상하기 위하여 프리 엠퍼시스 필터를 통해 보상하고 있다. 프리 엠퍼시스 필터를 간단한 수식으로 표현하면 y(n)=s(n)-As(n-1)와 같이 차분 방정식으로 나타낼 수 있다. 여기서 A값은 보통 0.9에서 1사이의 값을 사용한다. 본 논문에서는 QMF 필터를 이용하여 입력신호를 고주파와 저주파의 2개의 대역으로 분할하여 각 밴드에 프리 엠퍼시스 필터를 적용하여 약화되어진 특성을 정확히 보상하는 방법을 제안한다.
PDF

The Slop Compensation Algorithm of Speech Spectrum by QMF (Quadrature Mirror Filter) (QMF Filter에 의한 음성스펙트럼의 기울기 보상 알고리즘)

Min, So-Yeon;Bae, Myung-Jin
- Proceedings of the KAIS Fall Conference
- /
- 2006.05a
- /
- pp.364-367
- /
- 2006
음성신호를 관찰하였을 때 성문특성으로 인해서 고주파 쪽 특성이 약화되는 경향이 있다. 약화된 고주파 특성을 보상하기 위하여 프리 엠퍼시스 필터를 통해 보상하고 있다. 프리 엠퍼시스 필터를 간단한 수식으로 표현하면 y(n)=s(n)-As(n-1)와 같이 차분 방정식으로 나타낼 수 있다. 여기서 A값은 보통 0.9에서 1사이의 값을 사용한다. 본 논문에서는 QMF 필터를 이용하여 입력신호를 고주파와 저주파의 2개의 대역으로 분할하여 각 밴드에 프리 엠퍼시스 필터를 적용하여 약화되어진 특성을 정확히 보상하는 방법을 제안한다.
PDF

On the Flattening Techniques of Vocal track characteristics by using position information of the LSP (Line Spectrum Pairs) (LSP parameter의 위치정보를 이용한 성도특성 평탄화기법)

Kim YoungKyou;MIN SoYeon;BAE MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.171-174
- /
- 2002
음성신호는 성문특성으로 인해 고주파 특성이 약화되는 경향이 있다. 이를 보상하기 위해 Pre-emphasis filter를 사용한다. 수식으로 표현하면 y(n)=s(n)-As(n-1) 와 같이 차분방정식으로 나타낼 수 있다. 여기서 A값은 보통 0.9에서 1사이의 값을 주로 사용한다. 그러나 Pre-emphasis filter는 고주파 특성을 보상하는 과정에서 극점과 같이 영점도 왜곡된다. 본 논문에서는 음성특성에 따른 LSP(Line Spectrum Pairs) 분포특성을 이용하여 영점을 보존하고 vocoder 및 coding에 필연적인 고주파 특성 혹은 저주파 특성을 강조한다.
PDF

Performance Improvement of Speaker Recognition Using Enhanced Feature Extraction in Glottal Flow Signals and Multiple Feature Parameter Combination (Glottal flow 신호에서의 향상된 특징추출 및 다중 특징파라미터 결합을 통한 화자인식 성능 향상)

Kang, Jihoon;Kim, Youngil;Jeong, Sangbae
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.12
- /
- pp.2792-2799
- /
- 2015
In this paper, we utilize source mel-frequency cepstral coefficients (SMFCCs), skewness, and kurtosis extracted in glottal flow signals to improve speaker recognition performance. Generally, because the high band magnitude response of glottal flow signals is somewhat flat, the SMFCCs are extracted using the response below the predefined cutoff frequency. The extracted SMFCC, skewness, and kurtosis are concatenated with conventional feature parameters. Then, dimensional reduction by the principal component analysis (PCA) and the linear discriminat analysis (LDA) is followed to compare performances with conventional systems under equivalent conditions. The proposed recognition system outperformed the conventional system for large scale speaker recognition experiments. Especially, the performance improvement was more noticeable for small Gaussan mixtures.
https://doi.org/10.6109/jkiice.2015.19.12.2792 인용 PDF KSCI

Search Result 18, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)