Search | Korea Science

Extraction of MFCC feature parameters based on the PCA-optimized filter bank and Korean connected 4-digit telephone speech recognition (PCA-optimized 필터뱅크 기반의 MFCC 특징파라미터 추출 및 한국어 4연숫자 전화음성에 대한 인식실험)

정성윤;김민성;손종목;배건성
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.6
- /
- pp.279-283
- /
- 2004
In general, triangular shape filters are used in the filter bank when we extract MFCC feature parameters from the spectrum of the speech signal. A different approach, which uses specific filter shapes in the filter bank that are optimized to the spectrum of training speech data, is proposed by Lee et al. to improve the recognition rate. A principal component analysis method is used to get the optimized filter coefficients. Using a large amount of 4-digit telephone speech database, in this paper, we get the MFCCs based on the PCA-optimized filter bank and compare the recognition performance with conventional MFCCs and direct weighted filter bank based MFCCs. Experimental results have shown that the MFCC based on the PCA-optimized filter bank give slight improvement in recognition rate compared to the conventional MFCCs but fail to achieve better performance than the MFCCs based on the direct weighted filter bank analysis. Experimental results are discussed with our findings.
PDF KSCI

Semi-auto Calibration Method Using Circular Sample Pixel and Homography Estimation (원형 샘플 화소와 호모그래피 예측을 이용한 반자동 카메라 캘리브레이션 방법)

Shin, Dong-Won;Ho, Yo-Sung
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.11a
- /
- pp.67-70
- /
- 2015
최근 깊이 영상 기반 렌더링 방법을 이용하여 제작된 3차원 컨텐츠가 우리의 눈을 즐겁게 해주고 있다. 이러한 깊이 영상 기반 렌더링에서는 필연적으로 색상 카메라와 깊이 카메라 간의 시점 차이가 발생한다. 따라서 두 시점을 일치시키는 전처리 과정으로서 카메라 파라미터가 중요한 역할을 수행한다. 카메라 파라미터를 획득하는 과정으로 카메라 캘리브레이션이 수행된다. 널리 사용되는 기존의 카메라 캘리브레이션 방법은 평면의 체스보드 패턴을 여러 자세로 촬영한 다음 패턴 특징점을 손으로 직접 선택해야하는 불편함이 따른다. 따라서 본 논문에서는 이 문제를 해결하기 위해 원형 샘플 화소 검사와 호모그래피 예측을 이용한 반자동 카메라 캘리브레이션을 제안한다. 제안하는 방법은 먼저 FAST 코너 검출 알고리즘을 이용하여 패턴 특징점의 후보를 영상으로부터 추출한다. 다음으로 원형 샘플 화소를 검사하여 후보군의 크기를 줄인다. 그리고 호모그래피 예측을 통해 손실된 패턴 특징점을 보완하는 완전한 패턴 특징점군을 획득한다. 마지막으로 화소 정확성 향상을 통해 실수 단위의 정확성을 가지는 패턴 특징점의 위치를 획득한다. 실험을 통해 제안하는 방법이 기존의 방법과 비교하여 카메라 파라미터의 정확성은 유지하고 수작업의 불편함을 해소할 수 있음을 확인했다.
PDF

Feature Parameter Extraction and Speech Recognition Using Matrix Factorization (Matrix Factorization을 이용한 음성 특징 파라미터 추출 및 인식)

Lee Kwang-Seok;Hur Kang-In
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.7
- /
- pp.1307-1311
- /
- 2006
In this paper, we propose new speech feature parameter using the Matrix Factorization for appearance part-based features of speech spectrum. The proposed parameter represents effective dimensional reduced data from multi-dimensional feature data through matrix factorization procedure under all of the matrix elements are the non-negative constraint. Reduced feature data presents p art-based features of input data. We verify about usefulness of NMF(Non-Negative Matrix Factorization) algorithm for speech feature extraction applying feature parameter that is got using NMF in Mel-scaled filter bank output. According to recognition experiment results, we confirm that proposed feature parameter is superior to MFCC(Mel-Frequency Cepstral Coefficient) in recognition performance that is used generally.
PDF KSCI

Robust Face Alignment using Progressive AAM (점진적 AAM을 이용한 강인한 얼굴 윤곽 검출)

Kim, Dae-Hwan;Kim, Jae-Min;Cho, Seong-Won;Jang, Yong-Suk;Kim, Boo-Gyoun;Chung, Sun-Tae
- The Journal of the Korea Contents Association
- /
- v.7 no.2
- /
- pp.11-20
- /
- 2007
AAM has been successfully applied to face alignment, but its performance is very sensitive to initial values. In this paper, we propose a face alignment method using progressive AAM. The proposed method consists of two stages; modelling and relation derivation stage and fitting stage. Modelling and relation derivation stage first builds two AAM models; the inner face AAM model and the whole face AAM model and then derive the relation matrix between the inner face AAM model parameter vector and the whole face AAM model parameter vector. The fitting stage is processed progressively in two phases. In the first phase, the proposed method finds the feature parameters for the inner facial feature points of a new face, and then in the second phase it localizes the whole facial feature points of the new face using the initial values estimated utilizing the inner feature parameters obtained in the first phase and the relation matrix obtained in the first stage. Through experiments, it is verified that the proposed progressive AAM-based face alignment method is more robust with respect to pose, and face background than the conventional basic AAM-based face alignment.
https://doi.org/10.5392/JKCA.2007.7.2.011 인용 PDF

A Study on Comfortableness Classification using Multi-channel EEG and Neural Network (다중채널 뇌파와 신경회로망을 이용한 쾌적성 분류에 관한 연구)

김흥환;이상한;강동기;김동준;고한우
- Proceedings of the Korean Society for Emotion and Sensibility Conference
- /
- 2002.05a
- /
- pp.215-220
- /
- 2002
본 연구에서는 다중채널 뇌파에서 특징 파라미터로 선형 예측기 계수(Linear predictor coefficients)를 추출하고, 패턴인식기로는 신경회로망을 이용한 쾌적성 분류 알고리즘을 개발하여 다중 템플릿 방법으로 쾌적성 분류 실험을 하고자 하였다. 뇌파 데이터는 대학생 10명으로부터 쾌적한 환경과 불쾌적한 환경에서의 데이터를 수집하였으며, 전극 위치는 Fpl, Fp2, F3, F4, T3, T4, P3, P4, O1, O2를 사용하였다. 수집된 뇌파는 전처리를 거친 후 특징 파라미터를 추출하고 패턴 분류기로 사용된 신경회로망의 입력으로 사용하였다. 쾌적성 분류 방법은 다중템플릿 방법으로 여러 명의 피검자를 각각 학습시켜 이로부터 생성되는 신경회로망의 가중치들을 템플릿에 저장한다. 그리고 테스트를 할 때에는 먼저 처음의 안정 상태의 뇌파를 이용하여 템플릿 검색을 하고 가장 가까운 템플릿을 선택한다. 그리고 선택된 템플릿을 이용하여 다른 감정에 대한 쾌적성 분류 실험을 하게 된다. 쾌적성 분류 실험 결과 평균 인식률이 약 75%의 성능을 나타내었다.
PDF

Content Based Classification of Audio Signal using Discriminant Function (식별함수를 이용한 오디오신호의 내용기반 분류)

Kim, Young-Sub;Lee, Kwang-Seok;Koh, Si-Young;Hur, Kang-In
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2007.06a
- /
- pp.201-204
- /
- 2007
In this paper, we research the content-based analysis and classification according to the composition of the feature parameters pool for the auditory signals to implement the auditory indexing and searching system. Auditory data is classified to the primitive various auditory types. we described the analysis and feature extraction method for the feature parameters available to the auditory data classification. And we compose the feature parameters pool in the indexing group unit, then compare and analysis the auditory data centering around the including level and indexing criterion into the audio categories. Based on this result, we composit feature vectors of audio data according to the classification categories, then experiment the classification using discrimination function.
PDF

A Study on Classification of Four Emotions using EEG (뇌파를 이용한 4가지 감정 분류에 관한 연구)

강동기;김동준;김흥환;고한우
- Proceedings of the Korean Society for Emotion and Sensibility Conference
- /
- 2001.11a
- /
- pp.87-90
- /
- 2001
본 연구에서는 감성 평가 시스템에 가장 적합한 파라미터를 찾기 위하여 3가지 뇌파 파라미터를 이용하여 감정 분류 실험을 하였다. 뇌파 파라미터는 선형예측기계수(linear predictor coefficients)와 FFT 스펙트럼 및 AR 스펙트럼의 밴드별 상호상관계수(cross-correlation coefficients)를 이용하였으며, 감정은 relaxation, joy, sadness, irritation으로 설정하였다. 뇌파 데이터는 대학의 연극동아리 학생 4명을 대상으로 수집하였으며, 전극 위치는 Fp1, Fp2, F3, F4, T3, T4, P3, P4, O1, O2를 사용하였다. 수집된 뇌파 데이터는 전처리를 거친 후 특징 파라미터를 추출하고 패턴 분류기로 사용된 신경회로망(neural network)에 입력하여 감정 분류를 하였다. 감정 분류실험 결과 선형예측기계수를 이용하는 것이 다른 2가지 보다 좋은 성능을 나타내었다.
PDF

Pattern Feature Detection for Camera Calibration using Circular Sample Pixel (원형 샘플 화소를 이용한 카메라 캘리브레이션 패턴 특징점 검출)

Shin, Dong-Won;Ho, Yo-Sung
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.07a
- /
- pp.433-434
- /
- 2015
카메라 캘리브레이션은 다시점 카메라 시스템에서 내부와 외부 인자로 이루어진 카메라 파라미터를 획득하는 과정을 의미 한다. 이는 3차원으로 표현되는 장면과 카메라간의 구조를 다루기 위해 중요하다. 그러나 카메라 캘리브레이션은 사람이 직접 손으로 각 영상에서 사각형의 네 점을 정확히 찍어 주어야 하는 과정 때문에 카메라의 수와 패턴 영상의 수가 늘어남에 따라 상당히 번거로운 작업이 된다. 본 논문에서는 카메라 캘리브레이션 과정에서 손으로 수행하는 작업을 줄이기 위해 자동으로 패턴 특징점을 탐색하는 알고리즘을 제안한다. 제안하는 방법은 먼저 영상에서 패턴 특징점의 후보를 찾기 위해 해리스 코너 검출 방법을 사용한다. 그리고 후보 주변의 원형 샘플 화소를 이용하여 유효한 패턴 특징점을 추출한다. 실험 결과는 Matlab 캘리브레이션 툴박스를 이용하여 획득한 카메라 파라미터와 비교해 보았을 때 큰 차이가 없지만 수작업의 번거로움을 상당히 감소시켰음을 확인하였다.
PDF

A Study on Emotion Classification using 4-Channel EEG Signals (4채널 뇌파 신호를 이용한 감정 분류에 관한 연구)

Kim, Dong-Jun;Lee, Hyun-Min
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.2 no.2
- /
- pp.23-28
- /
- 2009
This study describes an emotion classification method using two different feature parameters of four-channel EEG signals. One of the parameters is linear prediction coefficients based on AR modelling. Another one is cross-correlation coefficients on frequencies of ${\theta}$, ${\alpha}$, ${\beta}$ bands of FFT spectra. Using the linear predictor coefficients and the cross-correlation coefficients of frequencies, the emotion classification test for four emotions, such as anger, sad, joy, and relaxation is performed with an artificial neural network. The results of the two parameters showed that the linear prediction coefficients have produced the better results for emotion classification than the cross-correlation coefficients of FFT spectra.
PDF

Adaptive Object Classification using DWT and FI (이산웨이블릿 변환과 퍼지추론을 이용한 적응적 물체 분류)

Kim, Yoon-Ho
- Journal of Advanced Navigation Technology
- /
- v.10 no.3
- /
- pp.219-225
- /
- 2006
This paper presents a method of object classification based on discrete wavelet transform (DWT) and fuzzy inference(FI). It concentrated not only on the design of fuzzy inference algorithm which is suitable for low speed uninhabited transportation such as, conveyor but also on the minimize the number of fuzzy rule. In the preprocess of feature extracting, feature parameters are extracted by using characteristics of the coefficients matrix of DWT. Such feature parameters as area, perimeter and a/p ratio are used obtained from DWT coefficients blocks. Secondly, fuzzy if - then rules that can be able to adapt the variety of surroundings are developed. In order to verify the performance of proposed scheme, In the middle of fuzzy inference, the Mamdani's and the Larsen 's implication operators are utilized. Experimental results showed that proposed scheme can be applied to the variety of surroundings.
PDF

Search Result 225, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)