Search | Korea Science

음성 인식률 향상을 위한 음성의 특징 파라미터 추출 알고리즘

Choi, Jae-Seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2017.05a
- /
- pp.686-687
- /
- 2017
본 논문에서는 잡음에 강인하고 음성인식 성능이 효과적인 멜 주파수 켑스트럼 계수의 파라미터의 추출 알고리즘을 제안한다. 본 논문에서 제안한 알고리즘은 배경잡음이 혼합된 깨끗한 연속음성 중에서 위너필터를 이용하여 음성에 포함된 배경잡음을 감소시키며, 이후에 멜 주파수 켑스트럼 계수의 특징추출 방법을 사용하여 음성의 특징 파라미터를 추출한다.
PDF

Feature Extraction of Disease Region in Stomach Images Based on DCT (DCT기반 위장영상 질환부위의 특징추출)

Ahn, Byeoung-Ju;Lee, Sang-Bock
- Journal of the Korean Society of Radiology
- /
- v.6 no.3
- /
- pp.167-171
- /
- 2012
In this paper, we present an algorithm to extract features about disease region in digital stomach images. For feature extraction, DCT coefficients of gastrointestinal imaging matrix was obtained. DCT coefficent matrix is concentrated energy in low frequency region, we were extracted 128 feature parameters in low frequency region. Extracted feature parameters can using for differential compression of PACS and, can using for input parameter in CAD.
https://doi.org/10.7742/jksr.2012.6.3.167 인용 PDF KSCI

Feature extraction based on DWT and GA for Gesture Recognition of EPIC Sensor Signals (EPIC 센서 신호의 제스처 인식을 위한 이산 웨이블릿 변환과 유전자 알고리즘 기반 특징 추출)

Ji, Sang-Hun;Yang, Hyung-Jeong;Kim, Soo-Hyung;Kim, Young-Chul
- Proceedings of the Korea Information Processing Society Conference
- /
- 2016.04a
- /
- pp.612-615
- /
- 2016
본 논문에서는 EPIC(Electric Potential Integrated Circuit) 센서를 통해 추출된 동작신호에 대해 이산 웨이블릿 변환（Discrete Wavelet Transform : DWT)과 선형 판별분석（Linear Discriminant Analysis : LDA), Support Vector Machine(SVM)을 사용하는 동작 분류 시스템을 제안한다. EPIC 센서 신호에 대해 이산 웨이블릿 변환을 사용하여 웨이블릿 계수인 근사계수(approximation coefficients)와 상세계수(detail coefficients)를 구한 후, 각각의 웨이블릿 계수에 대해 특징 파라미터를 추출한다. 이 때, 특징 파라미터는 14개의 통계적 특징 추출 파라미터 중에 유전자 알고리즘(Genetic Algorithm : GA)을 통하여 선택한 우수한 특징 파라미터이다. 웨이블릿 계수들에서 추출한 특징 파라미터는 선형 판별분석을 적용하여 차원을 축소하고 SVM의 훈련 및 분류에 사용한다. 실험결과, 4가지 동작에 대한 EPIC 센서 신호분류에서 제안된 방법의 분류율이 99.75%로 원신호에 대한 HMM 분류율 97% 보다 높은 정확률을 보여주었다.
https://doi.org/10.3745/PKIPS.y2016m04a.612 인용 PDF

Disease Region Feature Extraction of Medical Image using Wavelet (Wavelet에 의한 의용영상의 병소부위 특징추출)

이상복;이주신
- Journal of the Korea Society of Computer and Information
- /
- v.3 no.3
- /
- pp.73-81
- /
- 1998
In this paper suggest for methods disease region feature extraction of medical image using wavelet. In the preprocessing, the shape informations of medical image are selected by performing the discrete wavelet transform(DWT) with four level coefficient matrix. In this approach, based on the characteristics of the coefficient matrix, 96 feature parameters are calculated as follows: Firstly. obtaining 32 feature parameters which have the characteristics of low frequency from the parameters according to the horizontal high frequency are calculated from the coefficient matrix of horizontal high frequency. In the third place, 16 vertical feature parameters are also calculated using the same kind of procedure with respect to the vertical high frequency. Finally, 32 feature parameters of diagonal high frequency are obtained from the coefficient matrix of diagonal high frequency. Consequently, 96 feature aprameters extracted. Using suggest algorithm in this paper will, implamentation can automatic recognition system, increasing efficiency of picture achieve communication system.
PDF

Disease Region Pattern Recognition Algorithm of Gastrointestinal Image using Wavelet Transform and Neural Network (Wavelet변환과 신경회로망에 의한 위장 영상의 질환 부위 패턴 인식 알고리즘)

이상복;이주신
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.36S no.5
- /
- pp.70-77
- /
- 1999
본 논문에서는 Wavelet을 이용한 위장 영상의 질환 부위 특징을 추출하여 질환 부위 패턴을 인식할 수 있는 알고리즘을 제안하였다. 전처리 과정으로서 위장 영상이 형태정보는 입력 영상을 DWT(Discrete wavelet transform)에 의해 4레벨 DWT 계수 행렬을 구하고 계수 행렬의 특징에 따라 저주파 계수 행렬로부터 저주파 특징 파라미터 32개, 수평 고주파 계수 행렬로부터 수평 고주파 특징 파라미터 16개, 수직 고주파 계수 행렬로부터 수직 고주파 특징 파라미터 16개, 그리고, 대각 고주파 계수 행렬로부터 대각 고주파 특징 파라미터 32개 등 모두 96개의 특징 파라미터를 추출한 후 각각의 특징 파라미터를 최대 값+0.5로 최소 값을 -0.5로 정규화 하여 신경회로망의 입력 벡터로 사용하였다. 위장 영상 패턴 인식을 위한 신경회로망은 교사 학습을 요구하는 다층 구조의 오차 역전파(Error back propagation)알고리즘으로 하였고 구조적 특성을 이용하여 입력층, 중간층, 출력층의 계층 구조로 설계하였다. 설계된 신경회로망의 학습은 학습계수를 0.2로 모우멘텀을 0.6으로 설정하여 출력층 최대오차가 0.01보다 작을 때까지 수행하였으며 약 8000회 정도 학습한 결과 설정값 보다 작은 결과를 얻었고 질환의 종류나 위치, 크기에 관계없이 100%의 인식률을 얻었다.
PDF

Extension of K-L Dynamic Parameter for Connected Digit Recognition (숫자음 인식을 위한 K-L 동적 특징파라미터의 확장)

김주곤
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.257-261
- /
- 1998
일반적으로 인식률이 저조한 연속 숫자음의 인식 정도 향상을 위해서 K-L 동적특징의 확장에 대해서 검토한다. 이 검토결과를 4연속 숫자음을 대상으로 하는 인식 실험을 수행하여 숫자음 인식에 있어서 확장된 K-L 동적특징의 유효성을 확인하고자 한다. 이를 위하여 음성자료는 국어공학센터에서 채록한 4연속 숫자음을 사용하며, 확장한 K-L 동적특징의 유효성을 확인하기 위해서는 단일 특징 파라미터로서 멜-켑스트럼과 회귀계수, K-L 동적계수 등과 이들 특징 파라미터를 결합한 경우에 대해서 특징파라미터를 확장하여 K-L 동적 특징을 추출하고, 4연속 숫자음인식 실험을 수행하였다. 이때 인식의 기본 단위로는 48개의 유사음소단위를 음소모델로 사용하였으며, 인식실험에 있어서는 유한 상태 오토마타에 의한 구문제어를 통한 OPDP 법을 이용하였다. 인식 실험 결과, 단일 특징파라미터로서 멜-켑스트럼을 사용한 경우 67.5%, 이를 확장한 K-L 동적계수를 사용한 경우 78.2%를 보였다. 또한 결합한 특징파라미터에 있어서는 멜-켑스트럼과 희귀계수를 사용한 경우 78.4%의 인식률을 보였으며, 이를 K-L 동적계수로 확장한 경우 82.3%의 인식률을 얻어 확장한 K-L 동적특징파라미터의 유효성을 확인하였다.
PDF

Speaker Independent Recognition Algorithm based on Parameter Extraction by MFCC applied Wiener Filter Method (위너필터법이 적용된 MFCC의 파라미터 추출에 기초한 화자독립 인식알고리즘)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.21 no.6
- /
- pp.1149-1154
- /
- 2017
To obtain good recognition performance of speech recognition system under background noise, it is very important to select appropriate feature parameters of speech. The feature parameter used in this paper is Mel frequency cepstral coefficient (MFCC) with the human auditory characteristics applied to Wiener filter method. That is, the feature parameter proposed in this paper is a new method to extract the parameter of clean speech signal after removing background noise. The proposed method implements the speaker recognition by inputting the proposed modified MFCC feature parameter into a multi-layer perceptron network. In this experiments, the speaker independent recognition experiments were performed using the MFCC feature parameter of the 14th order. The average recognition rates of the speaker independent in the case of the noisy speech added white noise are 94.48%, which is an effective result. Comparing the proposed method with the existing methods, the performance of the proposed speaker recognition is improved by using the modified MFCC feature parameter.
https://doi.org/10.6109/jkiice.2017.21.6.1149 인용 PDF KSCI

Feature Parameter Extraction for Shape Information Analysis of 2-D Moving Object (2-D 이동물체의 형태 정보 분석을 위한 특징 파라미터 추출)

김윤호;이주신
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.16 no.11
- /
- pp.1132-1142
- /
- 1991
This paper proposed a method of feature parameter extraction for shape information analysis of moving object. In the 2-D plane, moving object are extracted by the difference method. Feature parameters of moving object are chosen area, perimeter, a/p ratio, vertex, x/y ratio. We changed brightness variation from the range of 600Lux to the 1400Lux and then determined Permissible Error range of feature parameter due to the brightness variation. So as to verify the validity of proposed method, experiment are performed with a toy car and it's results showed that decision error was less than 6%.
PDF

A Voice Boundary Detection Method Using Dynamic Parameters Based On Neural Network (신경망 기반의 동적 파라미터들을 이용한 음성 경계 추출)

마창수;김계영;최형일
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.10d
- /
- pp.616-618
- /
- 2002
본 논문에서는 음성인식 성능을 높이기 위한 기본적 단계인 음성과 비음성 부분의 경계를 추출하는 음성 경계 추출 방법을 제안한다. 음성경계 추출을 위한 특징들로는 시간영역 분할 파라미터인 ZCR, MA를 사용하고 주파수 영역 분할 파라미터로 주파수 대역 파워 에너지 (Frequency band power energy), 포만트 계수 (Formant coefficient)를 사용하였고 각 파라미터들을 이용하여 음성 경계를 결정할 때 경험에 의해 임계치를 결정하는 단점을 보안하기 위해서 신경망을 이용한다. 신경망의 가중치와 임계치들은 지도 학습을 통해 최적화 되고, 학습을 통해 구성된 망을 음성과 비음성의 경계치 구분에 사용한다.
PDF

Covariance Model Based on Multi-Band for Speaker Verification in Noise (잡음 환경에서 화자 확인을 위한 다중대역에 기반한 공분산 방법)

Choi Min Jung;Lee Ki Yong
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.127-130
- /
- 2004
기존의 전대역(Full-Band)에서 특징 파라미터를 추출하는 화자 확인(Speaker Verification) 시스템은 저대역이나 고대역에서 화자 정보의 특징이 제거되기 쉽다. 또한, 주파수 스펙트럼에 부분적으로 오염이 되는 경우, 특징 파라미터를 왜곡시켜 화자 확인 시스템의 성능을 저하시킨다. 본 논문에서는 이러한 문제점을 해결하기 위해 다중대역 공분산 모델(Covariance Model)을 제안한다. 제안한 방법은 주파수 영역에서 전대역을 여러 개의 부대역(Sub-Band)으로 분할하고, 부대역별로 독립적으로 특징 파라미터를 추출하여 공분산 모델을 구한다. 제안된 방법의 성능 확인을 위하여 공분산 모델 간의 거리를 측정하는 화자 확인 실험을 하였다. 잡음 환경에서 기존의 방법인 전대역에 기반한 공분산 모델과 제안한 방법을 비교 분석한 결과, 제안한 방법이 기존 방법보다 $2\%$정도 성능이 향상되었다. 또한, 제안된 방법은 전대역에 기반한 파라미터 차원 수를 다중대역의 개수로 분할하여 사용하므로 계산량의 감소와 저장 공간면에서 효율적이다.
PDF

Search Result 224, Processing Time 0.038 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)