Search | Korea Science

Audio Fingerprint Retrieval Method Based on Feature Dimension Reduction and Feature Combination

Zhang, Qiu-yu;Xu, Fu-jiu;Bai, Jian
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.15 no.2
- /
- pp.522-539
- /
- 2021
In order to solve the problems of the existing audio fingerprint method when extracting audio fingerprints from long speech segments, such as too large fingerprint dimension, poor robustness, and low retrieval accuracy and efficiency, a robust audio fingerprint retrieval method based on feature dimension reduction and feature combination is proposed. Firstly, the Mel-frequency cepstral coefficient (MFCC) and linear prediction cepstrum coefficient (LPCC) of the original speech are extracted respectively, and the MFCC feature matrix and LPCC feature matrix are combined. Secondly, the feature dimension reduction method based on information entropy is used for column dimension reduction, and the feature matrix after dimension reduction is used for row dimension reduction based on energy feature dimension reduction method. Finally, the audio fingerprint is constructed by using the feature combination matrix after dimension reduction. When speech's user retrieval, the normalized Hamming distance algorithm is used for matching retrieval. Experiment results show that the proposed method has smaller audio fingerprint dimension and better robustness for long speech segments, and has higher retrieval efficiency while maintaining a higher recall rate and precision rate.
https://doi.org/10.3837/tiis.2021.02.008 인용 PDF KSCI HTML

Dimension Reduction Method of Speech Feature Vector for Real-Time Adaptation of Voice Activity Detection (음성구간 검출기의 실시간 적응화를 위한 음성 특징벡터의 차원 축소 방법)

Park Jin-Young;Lee Kwang-Seok;Hur Kang-In
- Journal of the Institute of Convergence Signal Processing
- /
- v.7 no.3
- /
- pp.116-121
- /
- 2006
In this paper, we propose the dimension reduction method of multi-dimension speech feature vector for real-time adaptation procedure in various noisy environments. This method which reduces dimensions non-linearly to map the likelihood of speech feature vector and noise feature vector. The LRT(Likelihood Ratio Test) is used for classifying speech and non-speech. The results of implementation are similar to multi-dimensional speech feature vector. The results of speech recognition implementation of detected speech data are also similar to multi-dimensional(10-order dimensional MFCC(Mel-Frequency Cepstral Coefficient)) speech feature vector.
PDF

3D Data Dimension Reduction for Efficient Feature Extraction in Posture Recognition (포즈 인식에서 효율적 특징 추출을 위한 3차원 데이터의 차원 축소)

Kyoung, Dong-Wuk;Lee, Yun-Li;Jung, Kee-Chul
- The KIPS Transactions:PartB
- /
- v.15B no.5
- /
- pp.435-448
- /
- 2008
3D posture recognition is a solution to overcome the limitation of 2D posture recognition. There are many researches carried out for 3D posture recognition using 3D data. The 3D data consist of massive surface points which are rich of information. However, it is difficult to extract the important features for posture recognition purpose. Meanwhile, it also consumes lots of processing time. In this paper, we introduced a dimension reduction method that transform 3D surface points of an object to 2D data representation in order to overcome the issues of feature extraction and time complexity of 3D posture recognition. For a better feature extraction and matching process, a cylindrical boundary is introduced in meshless parameterization, its offer a fast processing speed of dimension reduction process and the output result is applicable for recognition purpose. The proposed approach is applied to hand and human posture recognition in order to verify the efficiency of the feature extraction.
https://doi.org/10.3745/KIPSTB.2008.15-B.5.435 인용 PDF KSCI

Action Recognition with deep network features and dimension reduction

Li, Lijun;Dai, Shuling
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.2
- /
- pp.832-854
- /
- 2019
Action recognition has been studied in computer vision field for years. We present an effective approach to recognize actions using a dimension reduction method, which is applied as a crucial step to reduce the dimensionality of feature descriptors after extracting features. We propose to use sparse matrix and randomized kd-tree to modify it and then propose modified Local Fisher Discriminant Analysis (mLFDA) method which greatly reduces the required memory and accelerate the standard Local Fisher Discriminant Analysis. For feature encoding, we propose a useful encoding method called mix encoding which combines Fisher vector encoding and locality-constrained linear coding to get the final video representations. In order to add more meaningful features to the process of action recognition, the convolutional neural network is utilized and combined with mix encoding to produce the deep network feature. Experimental results show that our algorithm is a competitive method on KTH dataset, HMDB51 dataset and UCF101 dataset when combining all these methods.
https://doi.org/10.3837/tiis.2019.02.019 인용 PDF KSCI HTML

An investigation of subband decomposition and feature-dimension reduction for musical genre classification (음악 장르 분류를 위한 부밴드 분해와 특징 차수 축소에 관한 연구)

Seo, Jin Soo;Kim, Junghyun;Park, Jihyun
- The Journal of the Acoustical Society of Korea
- /
- v.36 no.2
- /
- pp.144-150
- /
- 2017
Musical genre is indispensible in constructing music information retrieval system, such as music search and classification. In general, the spectral characteristics of a music signal are obtained based on a subband decomposition to represent the relative distribution of the harmonic and the non-harmonic components. In this paper, we investigate the subband decomposition parameters in extracting features, which improves musical genre classification accuracy. In addition, the linear projection methods are studied to reduce the resulting feature dimension. Experiments on the widely used music datasets confirmed that the subband decomposition finer than the widely-adopted octave scale is conducive in improving genre-classification accuracy and showed that the feature-dimension reduction is effective reducing a classifier's computational complexity.
https://doi.org/10.7776/ASK.2017.36.2.144 인용 PDF KSCI

Discriminative Manifold Learning Network using Adversarial Examples for Image Classification

Zhang, Yuan;Shi, Biming
- Journal of Electrical Engineering and Technology
- /
- v.13 no.5
- /
- pp.2099-2106
- /
- 2018
This study presents a novel approach of discriminative feature vectors based on manifold learning using nonlinear dimension reduction (DR) technique to improve loss function, and combine with the Adversarial examples to regularize the object function for image classification. The traditional convolutional neural networks (CNN) with many new regularization approach has been successfully used for image classification tasks, and it achieved good results, hence it costs a lot of Calculated spacing and timing. Significantly, distrinct from traditional CNN, we discriminate the feature vectors for objects without empirically-tuned parameter, these Discriminative features intend to remain the lower-dimensional relationship corresponding high-dimension manifold after projecting the image feature vectors from high-dimension to lower-dimension, and we optimize the constrains of the preserving local features based on manifold, which narrow the mapped feature information from the same class and push different class away. Using Adversarial examples, improved loss function with additional regularization term intends to boost the Robustness and generalization of neural network. experimental results indicate that the approach based on discriminative feature of manifold learning is not only valid, but also more efficient in image classification tasks. Furthermore, the proposed approach achieves competitive classification performances for three benchmark datasets : MNIST, CIFAR-10, SVHN.
https://doi.org/10.5370/JEET.2018.13.5.2099 인용 PDF KSCI

A Classification Method Using Data Reduction

Uhm, Daiho;Jun, Sung-Hae;Lee, Seung-Joo
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.12 no.1
- /
- pp.1-5
- /
- 2012
Data reduction has been used widely in data mining for convenient analysis. Principal component analysis (PCA) and factor analysis (FA) methods are popular techniques. The PCA and FA reduce the number of variables to avoid the curse of dimensionality. The curse of dimensionality is to increase the computing time exponentially in proportion to the number of variables. So, many methods have been published for dimension reduction. Also, data augmentation is another approach to analyze data efficiently. Support vector machine (SVM) algorithm is a representative technique for dimension augmentation. The SVM maps original data to a feature space with high dimension to get the optimal decision plane. Both data reduction and augmentation have been used to solve diverse problems in data analysis. In this paper, we compare the strengths and weaknesses of dimension reduction and augmentation for classification and propose a classification method using data reduction for classification. We will carry out experiments for comparative studies to verify the performance of this research.
https://doi.org/10.5391/IJFIS.2012.12.1.1 인용 PDF KSCI

Comparative Analysis of Dimensionality Reduction Techniques for Advanced Ransomware Detection with Machine Learning (기계학습 기반 랜섬웨어 공격 탐지를 위한 효과적인 특성 추출기법 비교분석)

Kim Han Seok;Lee Soo Jin
- Convergence Security Journal
- /
- v.23 no.1
- /
- pp.117-123
- /
- 2023
To detect advanced ransomware attacks with machine learning-based models, the classification model must train learning data with high-dimensional feature space. And in this case, a 'curse of dimension' phenomenon is likely to occur. Therefore, dimensionality reduction of features must be preceded in order to increase the accuracy of the learning model and improve the execution speed while avoiding the 'curse of dimension' phenomenon. In this paper, we conducted classification of ransomware by applying three machine learning models and two feature extraction techniques to two datasets with extremely different dimensions of feature space. As a result of the experiment, the feature dimensionality reduction techniques did not significantly affect the performance improvement in binary classification, and it was the same even when the dimension of featurespace was small in multi-class clasification. However, when the dataset had high-dimensional feature space, LDA(Linear Discriminant Analysis) showed quite excellent performance.
https://doi.org/10.33778/kcsa.2023.23.1.117 인용 PDF

PCA-SVM Based Vehicle Color Recognition (PCA-SVM 기법을 이용한 차량의 색상 인식)

Park, Sun-Mi;Kim, Ku-Jin
- The KIPS Transactions:PartB
- /
- v.15B no.4
- /
- pp.285-292
- /
- 2008
Color histograms have been used as feature vectors to characterize the color features of given images, but they have a limitation in efficiency by generating high-dimensional feature vectors. In this paper, we present a method to reduce the dimension of the feature vectors by applying PCA (principal components analysis) to the color histogram of a given vehicle image. With SVM (support vector machine) method, the dimension-reduced feature vectors are used to recognize the colors of vehicles. After reducing the dimension of the feature vector by a factor of 32, the successful recognition rate is reduced only 1.42% compared to the case when we use original feature vectors. Moreover, the computation time for the color recognition is reduced by a factor of 31, so we could recognize the colors efficiently.
https://doi.org/10.3745/KIPSTB.2008.15-B.4.285 인용 PDF KSCI

Music Mood Classification based on a New Feature Reduction Method and Modular Neural Network (단위 신경망과 특징벡터 차원 축소 기반의 음악 분위기 자동판별)

Song, Min Kyun;Kim, HyunSoo;Moon, Chang-Bae;Kim, Byeong Man;Oh, Dukhwan
- Journal of Korea Society of Industrial Information Systems
- /
- v.18 no.4
- /
- pp.25-35
- /
- 2013
This paper focuses on building a generalized mood classification model with many mood classes instead of a personalized one with few mood classes. Two methods are adopted to improve the performance of mood classification. The one of them is feature reduction based on standard deviation of feature values, which is designed to solve the problem of lowered performance when all 391 features provided by MIR toolbox used to extract features of music. The experiments show that the feature reduction methods suggested in this paper have better performance than that of the conventional dimension reduction methods, R-Square and PCA. As performance improvement by feature reduction only is subject to limit, modular neural network is used as another method to improve the performance. The experiments show that the method also improves performance effectively.
https://doi.org/10.9723/jksiis.2013.18.4.025 인용 PDF KSCI

Search Result 106, Processing Time 0.018 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)