통합 검색 | Korea Science

감정에 강인한 음성 인식을 위한 음성 파라메터 (Speech Parameters for the Robust Emotional Speech Recognition)

김원구
- 제어로봇시스템학회논문지
- /
- 제16권12호
- /
- pp.1137-1142
- /
- 2010
This paper studied the speech parameters less affected by the human emotion for the development of the robust speech recognition system. For this purpose, the effect of emotion on the speech recognition system and robust speech parameters of speech recognition system were studied using speech database containing various emotions. In this study, mel-cepstral coefficient, delta-cepstral coefficient, RASTA mel-cepstral coefficient and frequency warped mel-cepstral coefficient were used as feature parameters. And CMS (Cepstral Mean Subtraction) method were used as a signal bias removal technique. Experimental results showed that the HMM based speaker independent word recognizer using vocal tract length normalized mel-cepstral coefficient, its derivatives and CMS as a signal bias removal showed the best performance of 0.78% word error rate. This corresponds to about a 50% word error reduction as compare to the performance of baseline system using mel-cepstral coefficient, its derivatives and CMS.
https://doi.org/10.5302/J.ICROS.2010.16.12.1137 인용 PDF KSCI

방향과 경사도 분포를 이용한 패턴의 굴곡 성분 추출 (An extraction of depth information in pattern using directions and slopes)

전혜정;조동섭;김병철
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1992년도 하계학술대회 논문집 A
- /
- pp.462-464
- /
- 1992
In this paper, an extraction of depth intonation in pattern using neural network is presented. All the 3D images represent the depth information in grey pixels. This pixels which have analog values translated digital values. Because of the noise and distortion in pattern, we use the normalization in learning and recalling the patterns. Our method has eight direction vectors and slopes for pattern. Also, we use potential to obtain the mean slope and direction vectors of given 3D patches. The higher level of deduction finding the global depth information is also carried out by using neural network.
PDF

에너지 기반 가중치를 이용한 음성 특징의 자동회귀 이동평균 필터링 (ARMA Filtering of Speech Features Using Energy Based Weights)

반성민;김형순
- 한국음향학회지
- /
- 제31권2호
- /
- pp.87-92
- /
- 2012
In this paper, a robust feature compensation method to deal with the environmental mismatch is proposed. The proposed method applies energy based weights according to the degree of speech presence to the Mean subtraction, Variance normalization, and ARMA filtering (MVA) processing. The weights are further smoothed by the moving average and maximum filters. The proposed feature compensation algorithm is evaluated on AURORA 2 task and distant talking experiment using the robot platform, and we obtain error rate reduction of 14.4 % and 44.9 % by using the proposed algorithm comparing with MVA processing on AURORA 2 task and distant talking experiment, respectively.
https://doi.org/10.7776/ASK.2012.31.2.087 인용 PDF KSCI

고른 필터를 이용한 인공위성의 자세 추정 (Spacecraft Attitude Estimation by Unscented Filtering)

이현재;최윤혁;방효충;박종오
- 제어로봇시스템학회논문지
- /
- 제14권9호
- /
- pp.865-872
- /
- 2008
Spacecraft attitude estimation using the nonlinear unscented filter is addressed to fully utilize capabilities of the unscented transformation. To release significant computational load, an efficient technique is proposed by reasonably removing correlation between random variables. This modification introduces considerable reduction of sigma points and computational burden in matrix square-root calculation for most nonlinear systems. Unscented filter technique makes use of a set of sample points to predict mean and covariance. The general QUEST(QUaternion ESTimator) algorithm preserves explicitly the quaternion normalization, whereas extended Kalman filter(EKF) implicitly obeys the constraint. For spacecraft attitude estimation based on quaternion, an approach to computing quaternion means from sampled quaternions with guarantee of the quaternion norm constraint is introduced applying a constrained optimization technique. Finally, the performance of the new approach is demonstrated using a star tracker and rate-gyro measurements.
https://doi.org/10.5302/J.ICROS.2008.14.9.865 인용 PDF KSCI

Ordinal Rank 알고리즘을 이용한 자동 PIF 추출 - 변화탐지를 위한 상대방사정규화를 목적으로 (Automatic Extraction of Pseudo Invariant Features using Ordinal Rank Algorithm for Radiometric Normalization)

한유경;김대성;김용일
- 대한원격탐사학회:학술대회논문집
- /
- 대한원격탐사학회 2008년도 춘계학술대회 논문집
- /
- pp.213-218
- /
- 2008
동일 지점을 촬영한 위성영상은 위성의 센서나 영상의 취득 시기, 지형의 상태 등에 따라 그 지점에 나타나는 화소값이 일정하지 않다. 이러한 영상은 영상간 모자이크나 변화 탐지 결과에 영향을 미칠 가능성이 높으므로 방사보정(또는 방사정규화)을 통해 화소값의 차이를 최소화시킬 필요가 있다. 본 연구는 선형회귀식을 적용한 상대 방사정규화에 초점을 맞추고 있으며, 선형회귀식 구성에 필요한 PIF(Pseudo Invariant Feature)를 자동으로 추출하기 위해 Ordinal Rank 알고리즘을 적용하였다. 이 방법을 통해 각 밴드별 후보 PIF를 추출하고, 공통으로 해당되는 최종 PIF를 추출할 수 있었다. RMSE(Root Mean Square Error), Dynamic range, Coefficient of variation 등을 통해 방사보정 후의 결과를 평가해보았다. 영상회귀를 이용한 방사보정알고리즘과의 비교를 통해 제안된 알고리즘이 갖는 장점을 확인하였다.
PDF

Non-data Aided Timing Phase Recovery Scheme for Digital Equalization of Chromatic Dispersion and Polarization Mode Dispersion

Park, Jang-Woo;Chung, Won-Zoo;Park, Jong-Sun;Kim, Sung-Chul
- Journal of the Optical Society of Korea
- /
- 제13권3호
- /
- pp.367-372
- /
- 2009
In this paper we propose an electronic domain timing phase selection scheme for the optical communication systems suffering from inter-symbol-interference (ISI) distortion due to chromatic dispersion (CD) or polarization mode dispersion (PMD). In the presence of CD/PMD a proper timing phase selection is important for discrete time domain equalizers, since different timing phases produce different nonlinear ISI channels of different severity. The proposed timing phase recovery scheme based on dispersion minimization (DM) practically approximates the optimal minimum mean squared error (MMSE) timing phase without training signals which reduces overall throughput substantially, especially in time-varying channels such as PMD. The simulation results show that the proposed DM timing agrees with MMSE timing phase, under proper normalization of the received signals, for various dispersion and OSNR.
https://doi.org/10.3807/JOSK.2009.13.3.367 인용 PDF KSCI

조립토의 거칠기 및 모양 분석 (Roughness and Shape Analysis on Granular Materials)

민덕기;이완진;이종익
- 한국지반공학회:학술대회논문집
- /
- 한국지반공학회 2002년도 가을 학술발표회 논문집
- /
- pp.245-252
- /
- 2002
The roughness of Joomoonjin sand and the Dongchun river sand was analysed by the fractal theory. It was found that the fractal dimension(D$\_$F/) of Joomoonjin sand is a little smaller than the Dongchun river sand. That means Joomoonjin sand is smoother than the Dongchun river sand. The measurements of D$\_$F/ of different fraction of the Donchun river sand showed that large particles were rougher than fine particles. The shapes of both sands were analysed by the Discrete Fourier Transform(DFT) and the Grid-based(GB) method. Normalization of coefficients with respect to size, starting point and its orientation made the coefficients invaried to these characteristics. The mean of the normalized coefficients was used to reconstruct the average shape for both sands, respectively. The measurements of the ellipticity ratio of different fraction of both sands showed that Joomoonjin sand is slightly flatter than the Dongcun river sand.
PDF

신경망을 이용한 고신뢰성의 회귀분석 모델 (Regression Model With High Reliability by Using Neural Networks)

조용현
- 정보처리학회논문지B
- /
- 제8B권4호
- /
- pp.327-334
- /
- 2001
본 논문에서는 기울기하강과 동적터널링이 조합된 학습알고리즘의 다층신경망을 이용한 고신회성의 회귀분석 모델을 제안하였다. 기울기하강은 빠른 수렴속도의 최적화가 가능하도록 하기 위함이고, 동적터널링은 국소최적해를 만났을 때 이를 벗어난 새로운 연결가중치를 설정하여 전역최적해로 수렴되도록 하기 위함이다. 또한 대용량의 입력 데이터를 통계적으로 독립인 특징들의 집합으로 변환시키는 주요성분분석 기법의 속성을 살려 학습데이터의 차원을 감소시킴으로서 고차원의 학습데이터에 따른 회귀분석 모델의 제약도 동시에 해결하였다. 제안된 기법의 신경망을 3개의 독립변수 패턴을 가진 암모니아 제조공정문제와 10개의 독립변수 패턴을 가진 자동차 연비문제에 각각 적용하여 시뮬레이션한 결과, 기존의 역전과 알고리즘의 신경망이나 주요성분분석에 의한 차원을 감소시키지 않은 학습패턴을 이용한 신경망보다 각각 더욱 우수한 학습성능과 회귀성능이 있음을 확인할 수 있었다. 또한 학습패턴의 영평균 정규화로 회귀용 신경망의 성능을 더욱 더 개선하였다.
PDF

영평균 정규화와 PCA를 이용한 회귀 신경망의 성능개선 (Performance Improvement of Regression Neural Networks by Using PCA and Zero-Mean Normalization)

박용수;조용현
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2001년도 추계학술발표논문집 (상)
- /
- pp.515-518
- /
- 2001
본 논문에서는 전처리단계로 영평균 정규화 기법과 주요성분분석 기법을 도입하여 다층신경망을 이용한 고신뢰성의 회귀분석 모델을 제안한다. 영평균 정규화 기법은 데이터의 1차적 통계성을 고려하여 알고리즘을 간략화시키며, 주요성분분석 기법은 입력 데이터의 2차적 통계성을 고려하여 독립인 특징들의 집합으로 변환시켜 학습데이터의 차원을 감소시킬 수 있어 고차원의 학습데이터에 따른 회귀분석 모델의 제약을 해결할 수 있었다. 제안된 기법의 신경망을 3개의 독립변수를 가진 암모니아 제조공정문제와 10개의 독립변수를 가진 자동차 연비문제에 각각 적용하여 시뮬레이션한 결과, 단순정규화나 PCA를 적용하지 않는 경우보다 제안된 기법의 학습속도와 회귀성능이 더욱 더 우수함을 확인할 수 있었다.
PDF

전화망 환경에서 한국어 숫자음 인식을 위한 잡음처리 (Noise Reduction for Korean Connected Digit Recognition through Telephone Channel)

김규홍;김회린
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2003년도 5월 학술대회지
- /
- pp.211-214
- /
- 2003
일반적으로 음성 인식에서의 성능은 잡음의 영향으로 인하여 저하된다. 전화망을 통한 한국어 연속 숫자음 인식은 음성인식 분야에 있어서 어려운 영역에 속하는데, 이는 조음 현상으로 인한 인식률 저하되는 점과 전화망 채널의 영향으로 인하여 스펙트럼 포락이 왜곡되며 음성신호의 대역폭이 제한되기 때문이다. 본 논문에서는 잡음의 영향을 줄이기 위하여, 2WF(2-stage Wiener Filter) 와 SWP (SNR-dependent Waveform Processing) 그리고 CMN(Cepstrum Mean Normalization)을 사용하였다. 2WF는 음성 신호의 포만트 구조를 적게 왜곡시키면서 전체적인 가산잡음 뿐만 아니라 동적 가산잡음도 줄여준다. SWP는 음성파형에서 SNR값이 상대적으로 큰 부분을 강조하여 전체적인 SNR을 향상시킬 수 있다. 또한, CMN은 특징벡터로부터 채널잡음의 영향을 정규화하여 음성 인식 성능을 향상시킨다. 이러한 방법들을 전화망 한국어 연속 숫자음 DB를 이용하여 실험한 결과, 음성신호의 왜곡을 최소화하면서 잡음의 영향을 줄여 전화망에서의 숫자음 인식 성능을 향상시킬 수 있었다.
PDF

검색결과 147건 처리시간 0.031초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)