Search | Korea Science

Performance Comparison of Deep Feature Based Speaker Verification Systems (깊은 신경망 특징 기반 화자 검증 시스템의 성능 비교)

Kim, Dae Hyun;Seong, Woo Kyeong;Kim, Hong Kook
- Phonetics and Speech Sciences
- /
- v.7 no.4
- /
- pp.9-16
- /
- 2015
In this paper, several experiments are performed according to deep neural network (DNN) based features for the performance comparison of speaker verification (SV) systems. To this end, input features for a DNN, such as mel-frequency cepstral coefficient (MFCC), linear-frequency cepstral coefficient (LFCC), and perceptual linear prediction (PLP), are first compared in a view of the SV performance. After that, the effect of a DNN training method and a structure of hidden layers of DNNs on the SV performance is investigated depending on the type of features. The performance of an SV system is then evaluated on the basis of I-vector or probabilistic linear discriminant analysis (PLDA) scoring method. It is shown from SV experiments that a tandem feature of DNN bottleneck feature and MFCC feature gives the best performance when DNNs are configured using a rectangular type of hidden layers and trained with a supervised training method.
https://doi.org/10.13064/KSSS.2015.7.4.009 인용 PDF KSCI

Linear Feature Extraction from Satellite Imagery using Discontinuity-Based Segmentation Algorithm

Niaraki, Abolghasem Sadeghi;Kim, Kye-Hyun;Shojaei, Asghar
- Proceedings of the KSRS Conference
- /
- v.2
- /
- pp.643-646
- /
- 2006
This paper addresses the approach to extract linear features from satellite imagery using an efficient segmentation method. The extraction of linear features from satellite images has been the main concern of many scientists. There is a need to develop a more capable and cost effective method for the Iranian map revision tasks. The conventional approaches for producing, maintaining, and updating GIS map are time consuming and costly process. Hence, this research is intended to investigate how to obtain linear features from SPOT satellite imagery. This was accomplished using a discontinuity-based segmentation technique that encompasses four stages: low level bottom-up, middle level bottom-up, edge thinning and accuracy assessment. The first step is geometric correction and noise removal using suitable operator. The second step includes choosing the appropriate edge detection method, finding its proper threshold and designing the built-up image. The next step is implementing edge thinning method using mathematical morphology technique. Lastly, the geometric accuracy assessment task for feature extraction as well as an assessment for the built-up result has been carried out. Overall, this approach has been applied successfully for linear feature extraction from SPOT image.
PDF

Speaker Adaptation Using ICA-Based Feature Transformation

Jung, Ho-Young;Park, Man-Soo;Kim, Hoi-Rin;Hahn, Min-Soo
- ETRI Journal
- /
- v.24 no.6
- /
- pp.469-472
- /
- 2002
Speaker adaptation techniques are generally used to reduce speaker differences in speech recognition. In this work, we focus on the features fitted to a linear regression-based speaker adaptation. These are obtained by feature transformation based on independent component analysis (ICA), and the feature transformation matrices are estimated from the training data and adaptation data. Since the adaptation data is not sufficient to reliably estimate the ICA-based feature transformation matrix, it is necessary to adjust the ICA-based feature transformation matrix estimated from a new speaker utterance. To cope with this problem, we propose a smoothing method through a linear interpolation between the speaker-independent (SI) feature transformation matrix and the speaker-dependent (SD) feature transformation matrix. From our experiments, we observed that the proposed method is more effective in the mismatched case. In the mismatched case, the adaptation performance is improved because the smoothed feature transformation matrix makes speaker adaptation using noisy speech more robust.
PDF

A Study on Feature Projection Methods for a Real-Time EMG Pattern Recognition (실시간 근전도 패턴인식을 위한 특징투영 기법에 관한 연구)

Chu, Jun-Uk;Kim, Shin-Ki;Mun, Mu-Seong;Moon, In-Hyuk
- Journal of Institute of Control, Robotics and Systems
- /
- v.12 no.9
- /
- pp.935-944
- /
- 2006
EMG pattern recognition is essential for the control of a multifunction myoelectric hand. The main goal of this study is to develop an efficient feature projection method for EMC pattern recognition. To this end, we propose a linear supervised feature projection that utilizes linear discriminant analysis (LDA). We first perform wavelet packet transform (WPT) to extract the feature vector from four channel EMC signals. For dimensionality reduction and clustering of the WPT features, the LDA incorporates class information into the learning procedure, and finds a linear matrix to maximize the class separability for the projected features. Finally, the multilayer perceptron classifies the LDA-reduced features into nine hand motions. To evaluate the performance of LDA for the WPT features, we compare LDA with three other feature projection methods. From a visualization and quantitative comparison, we show that LDA has better performance for the class separability, and the LDA-projected features improve the classification accuracy with a short processing time. We implemented a real-time pattern recognition system for a multifunction myoelectric hand. In experiment, we show that the proposed method achieves 97.2% recognition accuracy, and that all processes, including the generation of control commands for myoelectric hand, are completed within 97 msec. These results confirm that our method is applicable to real-time EMG pattern recognition far myoelectric hand control.
https://doi.org/10.5302/J.ICROS.2006.12.9.935 인용 PDF KSCI

A Study on the Extraction of Linear Features from Satellite Images and Automatic GCP Filing (위성영상의 선형특징 추출과 이를 이용한 자동 GCP 화일링에 관한 연구)

김정기;강치우;박래홍;이쾌희
- Korean Journal of Remote Sensing
- /
- v.5 no.2
- /
- pp.133-145
- /
- 1989
This paper describes an implementation of linear feature extraction algorithms for satellite images and a method of automatic GCP(Ground Control Point) filing using the extracted linear feature. We propose a new linear feature extraction algorithm which uses magnitude and direction information of edges. The result of applying the proposed algorithm to satellite images are presented and compared with those of the other algorithms. By using the proposed algorithm, automatic GCP filing was successfully performed.
https://doi.org/10.7780/kjrs.1989.5.2.133 인용 PDF

An Improved method of Two Stage Linear Discriminant Analysis

Chen, Yarui;Tao, Xin;Xiong, Congcong;Yang, Jucheng
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.3
- /
- pp.1243-1263
- /
- 2018
The two-stage linear discrimination analysis (TSLDA) is a feature extraction technique to solve the small size sample problem in the field of image recognition. The TSLDA has retained all subspace information of the between-class scatter and within-class scatter. However, the feature information in the four subspaces may not be entirely beneficial for classification, and the regularization procedure for eliminating singular metrics in TSLDA has higher time complexity. In order to address these drawbacks, this paper proposes an improved two-stage linear discriminant analysis (Improved TSLDA). The Improved TSLDA proposes a selection and compression method to extract superior feature information from the four subspaces to constitute optimal projection space, where it defines a single Fisher criterion to measure the importance of single feature vector. Meanwhile, Improved TSLDA also applies an approximation matrix method to eliminate the singular matrices and reduce its time complexity. This paper presents comparative experiments on five face databases and one handwritten digit database to validate the effectiveness of the Improved TSLDA.
https://doi.org/10.3837/tiis.2018.03.015 인용 PDF KSCI

The Duration Feature of Acoustic Signals and Korean Speakers' Perception of English Stops (구간 신호 길이 자질과 한국인의 영어 파열음 지각)

Kim, Mun-Hyong;Jun, Jong-Sup
- Phonetics and Speech Sciences
- /
- v.1 no.3
- /
- pp.19-28
- /
- 2009
This paper reports experimental findings about the duration feature of the acoustic components of English stops in Korean speakers' voicing perception. In our experiment, 35 participants discriminated between recorded stimuli and digitally transformed stimuli with different duration features from the original stimuli. 72 sets of paired stimuli are generated to test the effects of the duration feature in various phonetic contexts. The result of our experiment is a complicated cross-tabulation with 540 cells defined by five categorical independent variables plus one response variable. To find a meaningful generalization out of this complex frequency table, we ran logit log-linear regression analyses. Surprisingly, we have found that there is no single effect of the duration feature in all phonetic contexts on Korean speakers' perception of the voicing contrasts of English stops. Instead, the logit log-linear analyses reveal that there are interaction effects among phonetic contexts (=C), the places of articulation of stops (=P), and the voicing contrast (=V), and among duration (=T), phonetic contexts, and the places of articulation. To put it in mathematical terms, the distribution of the data can be explained by a simple log-linear equation, logF=${\mu}+{\lambda}CPV+{\lambda}TCP$.
PDF

Effective Combination of Temporal Information and Linear Transformation of Feature Vector in Speaker Verification (화자확인에서 특징벡터의 순시 정보와 선형 변환의 효과적인 적용)

Seo, Chang-Woo;Zhao, Mei-Hua;Lim, Young-Hwan;Jeon, Sung-Chae
- Phonetics and Speech Sciences
- /
- v.1 no.4
- /
- pp.127-132
- /
- 2009
The feature vectors which are used in conventional speaker recognition (SR) systems may have many correlations between their neighbors. To improve the performance of the SR, many researchers adopted linear transformation method like principal component analysis (PCA). In general, the linear transformation of the feature vectors is based on concatenated form of the static features and their dynamic features. However, the linear transformation which based on both the static features and their dynamic features is more complex than that based on the static features alone due to the high order of the features. To overcome these problems, we propose an efficient method that applies linear transformation and temporal information of the features to reduce complexity and improve the performance in speaker verification (SV). The proposed method first performs a linear transformation by PCA coefficients. The delta parameters for temporal information are then obtained from the transformed features. The proposed method only requires 1/4 in the size of the covariance matrix compared with adding the static and their dynamic features for PCA coefficients. Also, the delta parameters are extracted from the linearly transformed features after the reduction of dimension in the static features. Compared with the PCA and conventional methods in terms of equal error rate (EER) in SV, the proposed method shows better performance while requiring less storage space and complexity.
PDF

Performance Simulation of Various Feature-Initialization Algorithms for Forward-Viewing Mono-Camera-Based SLAM (전방 모노카메라 기반 SLAM 을 위한 다양한 특징점 초기화 알고리즘의 성능 시뮬레이션)

Lee, Hun;Kim, Chul Hong;Lee, Tae-Jae;Cho, Dong-Il Dan
- Journal of Institute of Control, Robotics and Systems
- /
- v.22 no.10
- /
- pp.833-838
- /
- 2016
This paper presents a performance evaluation of various feature-initialization algorithms for forward-viewing mono-camera based simultaneous localization and mapping (SLAM), specifically in indoor environments. For mono-camera based SLAM, the position of feature points cannot be known from a single view; therefore, it should be estimated from a feature initialization method using multiple viewpoint measurements. The accuracy of the feature initialization method directly affects the accuracy of the SLAM system. In this study, four different feature initialization algorithms are evaluated in simulations, including linear triangulation; depth parameterized, linear triangulation; weighted nearest point triangulation; and particle filter based depth estimation algorithms. In the simulation, the virtual feature positions are estimated when the virtual robot, containing a virtual forward-viewing mono-camera, moves forward. The results show that the linear triangulation method provides the best results in terms of feature-position estimation accuracy and computational speed.
https://doi.org/10.5302/J.ICROS.2016.16.0096 인용 PDF KSCI

Classification of High Impedance Fault Patterns by Recognition of Linear Prediction coefficients (선형 예측 계수의 인식에 의한 고저항 지락사고 유형의 분류)

Lee, Ho-Seob;Kong, Seong-Gon
- Proceedings of the KIEE Conference
- /
- 1996.07b
- /
- pp.1353-1355
- /
- 1996
This paper presents classification of high impedance fault pattern using linear prediction coefficients. A feature of neutral phase current is extracted by the linear predictive coding. This feature is classified into faults by a multilayer perceptron neural network. Neural network successfully classifies test data into three faults and one normal state.
PDF

Search Result 784, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)