Search | Korea Science

Applying feature normalization based on pole filtering to short-utterance speech recognition using deep neural network (심층신경망을 이용한 짧은 발화 음성인식에서 극점 필터링 기반의 특징 정규화 적용)

Han, Jaemin;Kim, Min Sik;Kim, Hyung Soon
- The Journal of the Acoustical Society of Korea
- /
- v.39 no.1
- /
- pp.64-68
- /
- 2020
In a conventional speech recognition system using Gaussian Mixture Model-Hidden Markov Model (GMM-HMM), the cepstral feature normalization method based on pole filtering was effective in improving the performance of recognition of short utterances in noisy environments. In this paper, the usefulness of this method for the state-of-the-art speech recognition system using Deep Neural Network (DNN) is examined. Experimental results on AURORA 2 DB show that the cepstral mean and variance normalization based on pole filtering improves the recognition performance of very short utterances compared to that without pole filtering, especially when there is a large mismatch between the training and test conditions.
https://doi.org/10.7776/ASK.2020.39.1.064 인용 PDF KSCI

Cepstral Feature Normalization Methods Using Pole Filtering and Scale Normalization for Robust Speech Recognition (강인한 음성인식을 위한 극점 필터링 및 스케일 정규화를 이용한 켑스트럼 특징 정규화 방식)

Choi, Bo Kyeong;Ban, Sung Min;Kim, Hyung Soon
- The Journal of the Acoustical Society of Korea
- /
- v.34 no.4
- /
- pp.316-320
- /
- 2015
In this paper, the pole filtering concept is applied to the Mel-frequency cepstral coefficient (MFCC) feature vectors in the conventional cepstral mean normalization (CMN) and cepstral mean and variance normalization (CMVN) frameworks. Additionally, performance of the cepstral mean and scale normalization (CMSN), which uses scale normalization instead of variance normalization, is evaluated in speech recognition experiments in noisy environments. Because CMN and CMVN are usually performed on a per-utterance basis, in case of short utterance, they have a problem that reliable estimation of the mean and variance is not guaranteed. However, by applying the pole filtering and scale normalization techniques to the feature normalization process, this problem can be relieved. Experimental results using Aurora 2 database (DB) show that feature normalization method combining the pole-filtering and scale normalization yields the best improvements.
https://doi.org/10.7776/ASK.2015.34.4.316 인용 PDF KSCI

Maximum dV/dt Detection Alaorithm for Photoplethysmography Waveform (광용적맥파 신호 최대 dV/dt 검출 알고리즘 개발)

Shin, Hangsik
- Proceedings of the KIEE Conference
- /
- 2015.07a
- /
- pp.1395-1396
- /
- 2015
본 연구의 목적은 광용적맥파 해석에 중요하게 사용되는 최대 상승기울기(maximum dV/dt) 지점 검출 알고리즘 개발로, 미분 및 필터링을 통한 전처리 과정, 극점 검출과정, 역탐색 등의 후처리 과정으로 구성되는 알고리즘을 구현하였다. 제안된 알고리즘의 성능을 평가하기 위하여 총 74,225개의 맥박파형을 사용한 검증을 수행하였으며, 동시에 측정된 심전도 QRS지점을 기준으로 최대 dV/dt 측정 위치 정확성을 판정하였다. 시뮬레이션 결과, 적응형 임계치 극점 검출 방법과 함께 사용하였을 때, 제안된 알고리즘은 기존 광용적맥파 상단, 하단극점 검출 알고리즘과 유사한 성능인 98.57%, 99.98%의 민감도와 특이도, 0.02%의 오검출율을 가지는 것으로 나타났다.
PDF

Coupling Matrix Synthesis Methods for RF/Microwave Filter Design (초고주파용 필터설계를 위한 결합행렬 합성법)

Choi, Dong-Muk;Kim, Che-Young
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.32 no.12A
- /
- pp.1346-1353
- /
- 2007
In this paper, the methods are presented for the calculation of general coupling coefficient matrixes used in the band pass filter design. They are calculated from transmission coefficient($S_{21}$) and reflection coefficient($S_{11}$) with desired characteristics derived from the poles of filter function and return loss(RL). The calculated matrixes from this method are transformed to the folded canonical filter structure using similarity transformation which lends us the practical filter design. Based on the resulting matrix, the folded canonical filter has been designed.
PDF KSCI

Selective pole filtering based feature normalization for performance improvement of short utterance recognition in noisy environments (잡음 환경에서 짧은 발화 인식 성능 향상을 위한 선택적 극점 필터링 기반의 특징 정규화)

Choi, Bo Kyeong;Ban, Sung Min;Kim, Hyung Soon
- Phonetics and Speech Sciences
- /
- v.9 no.2
- /
- pp.103-110
- /
- 2017
The pole filtering concept has been successfully applied to cepstral feature normalization techniques for noise-robust speech recognition. In this paper, it is proposed to apply the pole filtering selectively only to the speech intervals, in order to further improve the recognition performance for short utterances in noisy environments. Experimental results on AURORA 2 task with clean-condition training show that the proposed selectively pole-filtered cepstral mean normalization (SPFCMN) and selectively pole-filtered cepstral mean and variance normalization (SPFCMVN) yield error rate reduction of 38.6% and 45.8%, respectively, compared to the baseline system.
https://doi.org/10.13064/KSSS.2017.9.2.103 인용 PDF KSCI

A Study on the Channel Normalized Pitch Synchronous Cepstrum for Speaker Recognition (채널에 강인한 화자 인식을 위한 채널 정규화 피치 동기 켑스트럼에 관한 연구)

김유진;정재호
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.1
- /
- pp.61-74
- /
- 2004
In this paper, a contort- and speaker-dependent cepstrum extraction method and a channel normalization method for minimizing the loss of speaker characteristics in the cepstrum were proposed for a robust speaker recognition system over the channel. The proposed extraction method creates a cepstrum based on the pitch synchronous analysis using the inherent pitch of the speaker. Therefore, the cepstrum called the 〃pitch synchronous cepstrum〃 (PSC) represents the impulse response of the vocal tract more accurately in voiced speech. And the PSC can compensate for channel distortion because the pitch is more robust in a channel environment than the spectrum of speech. And the proposed channel normalization method, the 〃formant-broadened pitch synchronous CMS〃 (FBPSCMS), applies the Formant-Broadened CMS to the PSC and improves the accuracy of the intraframe processing. We compared the text-independent closed-set speaker identification on 56 females and 112 males using TIMIT and NTIMIT database, respectively. The results show that pitch synchronous km improves the error reduction rate by up to 7.7% in comparison with conventional short-time cepstrum and the error rates of the FBPSCMS are more stable and lower than those of pole-filtered CMS.
PDF KSCI

Parameter Analysis for Time Reduction in Extracting SIFT Keypoints in the Aspect of Image Stitching (영상 스티칭 관점에서 SIFT 특징점 추출시간 감소를 위한 파라미터 분석)

Moon, Won-Jun;Seo, Young-Ho;Kim, Dong-Wook
- Journal of Broadcast Engineering
- /
- v.23 no.4
- /
- pp.559-573
- /
- 2018
Recently, one of the most actively applied image media in the most fields such as virtual reality (VR) is omni-directional or panorama image. This image is generated by stitching images obtained by various methods. In this process, it takes the most time to extract keypoints necessary for stitching. In this paper, we analyze the parameters involved in the extraction of SIFT keypoints with the aim of reducing the computation time for extracting the most widely used SIFT keypoints. The parameters considered in this paper are the initial standard deviation of the Gaussian kernel used for Gaussian filtering, the number of gaussian difference image sets for extracting local extrema, and the number of octaves. As the SIFT algorithm, the Lowe scheme, the originally proposed one, and the Hess scheme which is a convolution cascade scheme, are considered. First, the effect of each parameter value on the computation time is analyzed, and the effect of each parameter on the stitching performance is analyzed by performing actual stitching experiments. Finally, based on the results of the two analyses, we extract parameter value set that minimize computation time without degrading.
https://doi.org/10.5909/JBE.2018.23.4.559 인용 PDF KSCI KPUBS

Search Result 7, Processing Time 0.025 seconds

Applying feature normalization based on pole filtering to short-utterance speech recognition using deep neural network (심층신경망을 이용한 짧은 발화 음성인식에서 극점 필터링 기반의 특징 정규화 적용)

Cepstral Feature Normalization Methods Using Pole Filtering and Scale Normalization for Robust Speech Recognition (강인한 음성인식을 위한 극점 필터링 및 스케일 정규화를 이용한 켑스트럼 특징 정규화 방식)

Maximum dV/dt Detection Alaorithm for Photoplethysmography Waveform (광용적맥파 신호 최대 dV/dt 검출 알고리즘 개발)

Coupling Matrix Synthesis Methods for RF/Microwave Filter Design (초고주파용 필터설계를 위한 결합행렬 합성법)

Selective pole filtering based feature normalization for performance improvement of short utterance recognition in noisy environments (잡음 환경에서 짧은 발화 인식 성능 향상을 위한 선택적 극점 필터링 기반의 특징 정규화)

A Study on the Channel Normalized Pitch Synchronous Cepstrum for Speaker Recognition (채널에 강인한 화자 인식을 위한 채널 정규화 피치 동기 켑스트럼에 관한 연구)

Parameter Analysis for Time Reduction in Extracting SIFT Keypoints in the Aspect of Image Stitching (영상 스티칭 관점에서 SIFT 특징점 추출시간 감소를 위한 파라미터 분석)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)