Search | Korea Science

Speaker Identification Using PCA Fuzzy Mixture Model (PCA 퍼지 혼합 모델을 이용한 화자 식별)

Lee, Ki-Yong
- Speech Sciences
- /
- v.10 no.4
- /
- pp.149-157
- /
- 2003
In this paper, we proposed the principal component analysis (PCA) fuzzy mixture model for speaker identification. A PCA fuzzy mixture model is derived from the combination of the PCA and the fuzzy version of mixture model with diagonal covariance matrices. In this method, the feature vectors are first transformed by each speaker's PCA transformation matrix to reduce the correlation among the elements. Then, the fuzzy mixture model for speaker is obtained from these transformed feature vectors with reduced dimensions. The orthogonal Gaussian Mixture Model (GMM) can be derived as a special case of PCA fuzzy mixture model. In our experiments, with having the number of mixtures equal, the proposed method requires less training time and less storage as well as shows better speaker identification rate compared to the conventional GMM. Also, the proposed one shows equal or better identification performance than the orthogonal GMM does.
PDF

Face Recognition using LDA Mixture Model (LDA 혼합 모형을 이용한 얼굴 인식)

Kim Hyun-Chul;Kim Daijin;Bang Sung-Yang
- Journal of KIISE:Software and Applications
- /
- v.32 no.8
- /
- pp.789-794
- /
- 2005
LDA (Linear Discriminant Analysis) provides the projection that discriminates the data well, and shows a very good performance for face recognition. However, since LDA provides only one transformation matrix over whole data, it is not sufficient to discriminate the complex data consisting of many classes like honan faces. To overcome this weakness, we propose a new face recognition method, called LDA mixture model, that the set of alf classes are partitioned into several clusters and we get a transformation matrix for each cluster. This detailed representation will improve the classification performance greatly. In the simulation of face recognition, LDA mixture model outperforms PCA, LDA, and PCA mixture model in terms of classification performance.
PDF KSCI

An Efficient Model Selection Method for a PCA Mixture Model (PCA 혼합 모형을 위한 효율적인 구조 선택 방법)

김현철;김대진;방승양
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.04b
- /
- pp.538-540
- /
- 2001
PCA는 다변수 데이터 해석법 중 가장 널리 알려진 방법 중 하나로 많은 응용을 가지고 있다. 그런데, PCA는 선형 모델이어서 비선형 구조를 분석하는데 효과적이지 않다. 이를 극복하기 위해서 PCA의 조합을 이용하는 PCA 혼합 모형이 제안되었다. PCA 혼합 모형의 핵심은 구조 선택, 즉 mixture 요소의 수와 PCA 기저의 수의 결정 인데 그의 체계적인 결정 방법이 필요하다. 본 논문에서는 단순화된 PCA 혼합 모형과 이를 위한 효율적인 구조 선택 방법을 제안한다. 각각의 mixture 요소 수에 대해서 모든 PCA 기저를 갖도록 한 상태에서 PCA 혼합 모형의 파라미터를 EM 알고리즘을 써서 결정한다. 최적의 mixture 요소의 수는 오류를 최소로 하는 것으로 결정한다. PCA 기저의 수는 PCA의 정렬성 특성을 이용해서 중요도가 적은 기저부터 하나씩 잘라 내며 오류가 최소로 하는 것으로 결정한다. 제안된 방법은 특히 다차원 데이터의 경우에 EM 학습의 횟수를 많이 줄인다. 인공 데이터에 대한 실험은 제안된 방법이 적절한 모델 구조를 결정한다는 것을 보여준다. 또, 눈 감지에 대한 실험은 제안된 방법이 실용적으로도 유용하다는 것을 보여준다.
PDF

Extensions of LDA by PCA Mixture Model and Class-wise Features (PCA 혼합 모형과 클래스 기반 특징에 의한 LDA의 확장)

Kim Hyun-Chul;Kim Daijin;Bang Sung-Yang
- Journal of KIISE:Software and Applications
- /
- v.32 no.8
- /
- pp.781-788
- /
- 2005
LDA (Linear Discriminant Analysis) is a data discrimination technique that seeks transformation to maximize the ratio of the between-class scatter and the within-class scatter While it has been successfully applied to several applications, it has two limitations, both concerning the underfitting problem. First, it fails to discriminate data with complex distributions since all data in each class are assumed to be distributed in the Gaussian manner; and second, it can lose class-wise information, since it produces only one transformation over the entire range of classes. We propose three extensions of LDA to overcome the above problems. The first extension overcomes the first problem by modeling the within-class scatter using a PCA mixture model that can represent more complex distribution. The second extension overcomes the second problem by taking different transformation for each class in order to provide class-wise features. The third extension combines these two modifications by representing each class in terms of the PCA mixture model and taking different transformation for each mixture component. It is shown that all our proposed extensions of LDA outperform LDA concerning classification errors for handwritten digit recognition and alphabet recognition.
PDF KSCI

Improved Algorithm for Fully-automated Neural Spike Sorting based on Projection Pursuit and Gaussian Mixture Model

Kim, Kyung-Hwan
- International Journal of Control, Automation, and Systems
- /
- v.4 no.6
- /
- pp.705-713
- /
- 2006
For the analysis of multiunit extracellular neural signals as multiple spike trains, neural spike sorting is essential. Existing algorithms for the spike sorting have been unsatisfactory when the signal-to-noise ratio(SNR) is low, especially for implementation of fully-automated systems. We present a novel method that shows satisfactory performance even under low SNR, and compare its performance with a recent method based on principal component analysis(PCA) and fuzzy c-means(FCM) clustering algorithm. Our system consists of a spike detector that shows high performance under low SNR, a feature extractor that utilizes projection pursuit based on negentropy maximization, and an unsupervised classifier based on Gaussian mixture model. It is shown that the proposed feature extractor gives better performance compared to the PCA, and the proposed combination of spike detector, feature extraction, and unsupervised classification yields much better performance than the PCA-FCM, in that the realization of fully-automated unsupervised spike sorting becomes more feasible.
PDF KSCI

Efficient Speaker Identification based on Robust VQ-PCA (강인한 VQ-PCA에 기반한 효율적인 화자 식별)

Lee Ki-Yong
- Journal of Internet Computing and Services
- /
- v.5 no.3
- /
- pp.57-62
- /
- 2004
In this paper, an efficient speaker identification based on robust vector quantizationprincipal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification, Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region, Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.
PDF

Speaker Identification Using Greedy Kernel PCA (Greedy Kernel PCA를 이용한 화자식별)

Kim, Min-Seok;Yang, Il-Ho;Yu, Ha-Jin
- MALSORI
- /
- no.66
- /
- pp.105-116
- /
- 2008
In this research, we propose a speaker identification system using a kernel method which is expected to model the non-linearity of speech features well. We have been using principal component analysis (PCA) successfully, and extended to kernel PCA, which is used for many pattern recognition tasks such as face recognition. However, we cannot use kernel PCA for speaker identification directly because the storage required for the kernel matrix grows quadratically, and the computational cost grows linearly (computing eigenvector of $l{\times}l$ matrix) with the number of training vectors I. Therefore, we use greedy kernel PCA which can approximate kernel PCA with small representation error. In the experiments, we compare the accuracy of the greedy kernel PCA with the baseline Gaussian mixture models using MFCCs and PCA. As the results with limited enrollment data show, the greedy kernel PCA outperforms conventional methods.
PDF

Global Covariance based Principal Component Analysis for Speaker Identification (화자식별을 위한 전역 공분산에 기반한 주성분분석)

Seo, Chang-Woo;Lim, Young-Hwan
- Phonetics and Speech Sciences
- /
- v.1 no.1
- /
- pp.69-73
- /
- 2009
This paper proposes an efficient global covariance-based principal component analysis (GCPCA) for speaker identification. Principal component analysis (PCA) is a feature extraction method which reduces the dimension of the feature vectors and the correlation among the feature vectors by projecting the original feature space into a small subspace through a transformation. However, it requires a larger amount of training data when performing PCA to find the eigenvalue and eigenvector matrix using the full covariance matrix by each speaker. The proposed method first calculates the global covariance matrix using training data of all speakers. It then finds the eigenvalue matrix and the corresponding eigenvector matrix from the global covariance matrix. Compared to conventional PCA and Gaussian mixture model (GMM) methods, the proposed method shows better performance while requiring less storage space and complexity in speaker identification.
PDF

Smoothed Local PC0A by BYY data smoothing learning

Liu, Zhiyong;Xu, Lei
- 제어로봇시스템학회:학술대회논문집
- /
- 2001.10a
- /
- pp.109.3-109
- /
- 2001
The so-called curse of dimensionality arises when Gaussian mixture is used on high-dimensional small-sample-size data, since the number of free elements that needs to be specied in each covariance matrix of Gaussian mixture increases exponentially with the number of dimension d. In this paper, by constraining the covariance matrix in its decomposed orthonormal form we get a local PCA model so as to reduce the number of free elements needed to be specified. Moreover, to cope with the small sample size problem, we adopt BYY data smoothing learning which is a regularization over maximum likelihood learning obtained from BYY harmony learning to implement this local PCA model.
PDF

A Study on Face Expression Recognition using LDA Mixture Model and Nearest Neighbor Pattern Classification (LDA 융합모델과 최소거리패턴분류법을 이용한 얼굴 표정 인식 연구)

No, Jong-Heun;Baek, Yeong-Hyeon;Mun, Seong-Ryong;Gang, Yeong-Jin
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2006.11a
- /
- pp.167-170
- /
- 2006
본 논문은 선형분류기인 LDA 융합모델과 최소거리패턴분류법을 이용한 얼굴표정인식 알고리즘 연구에 관한 것이다. 제안된 알고리즘은 얼굴 표정을 인식하기 위해 두 단계의 특징 추출과정과 인식단계를 거치게 된다. 먼저 특징추출 단계에서는 얼굴 표정이 담긴 영상을 PCA를 이용해 고차원에서 저차원의 공간으로 변환한 후, LDA 이용해 특징벡터를 클래스 별로 나누어 분류한다. 다음 단계로 LDA융합모델을 통해 계산된 특징벡터에 최소거리패턴분류법을 적용함으로서 얼굴 표정을 인식한다. 제안된 알고리즘은 6가지 기본 감정(기쁨, 화남, 놀람, 공포, 슬픔, 혐오)으로 구성된 데이터베이스를 이용해 실험한 결과, 기존알고리즘에 비해 향상된 인식률과 특정 표정에 관계없이 고른 인식률을 보임을 확인하였다.
PDF

Search Result 29, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)