Search | Korea Science

Music Genre Classification Based on Timbral Texture and Rhythmic Content Features

Baniya, Babu Kaji;Ghimire, Deepak;Lee, Joonwhon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.05a
- /
- pp.204-207
- /
- 2013
Music genre classification is an essential component for music information retrieval system. There are two important components to be considered for better genre classification, which are audio feature extraction and classifier. This paper incorporates two different kinds of features for genre classification, timbral texture and rhythmic content features. Timbral texture contains several spectral and Mel-frequency Cepstral Coefficient (MFCC) features. Before choosing a timbral feature we explore which feature contributes less significant role on genre discrimination. This facilitates the reduction of feature dimension. For the timbral features up to the 4-th order central moments and the covariance components of mutual features are considered to improve the overall classification result. For the rhythmic content the features extracted from beat histogram are selected. In the paper Extreme Learning Machine (ELM) with bagging is used as classifier for classifying the genres. Based on the proposed feature sets and classifier, experiment is performed with well-known datasets: GTZAN databases with ten different music genres, respectively. The proposed method acquires the better classification accuracy than the existing approaches.
https://doi.org/10.3745/PKIPS.y2013m05a.204 인용 PDF

China Dust-storm Monitoring Using Meteorological Satellite

Xiuqing, Hu;Naimeng, Lu;Peng, Zhang;Qian, Huang
- Proceedings of the KSRS Conference
- /
- 2003.11a
- /
- pp.1224-1226
- /
- 2003
Dust-storm is one of the heaviest hazardous weather which frequently affects most part of northern China in spring. Satellite multi-spectral observations can provide significant information for detecting and quantitative determining the property of dust-storm . An algorithm to monitor dust-storm automatically was developed based on satellite observation. The algorithm utilizes split widows technique and spectral classification technique and also developed a new dust remote sensing product Infra -red Difference Dust Index (IDDI) proxy dust-loading dataset using GMS-5.
PDF

Robot Control based on Steady-State Visual Evoked Potential using Arduino and Emotiv Epoc (아두이노와 Emotiv Epoc을 이용한 정상상태시각유발전위 (SSVEP) 기반의 로봇 제어)

Yu, Je-Hun;Sim, Kwee-Bo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.25 no.3
- /
- pp.254-259
- /
- 2015
In this paper, The wireless robot control system was proposed using Brain-computer interface(BCI) systems based on the steady-state visual evoked potential(SSVEP). Cross Power Spectral Density(CPSD) was used for analysis of electroencephalogram(EEG) and extraction of feature data. And Linear Discriminant Analysis(LDA) and Support Vector Machine(SVM) was used for patterns classification. We obtained the average classification rates of about 70% of each subject. Robot control was implemented using the results of classification of EEG and commanded using bluetooth communication for robot moving.
https://doi.org/10.5391/JKIIS.2015.25.3.254 인용 PDF KSCI

Car Noise Cancellation by Using Spectral Subtraction Method Based on a New Speech/nonspeech Classification Function (새로운 음성/비음성 분류함수에 기반한 스펙트럼 차감법에 의한 차량잡음제거)

박영식;이준재;이응주;하영호
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.19 no.6
- /
- pp.994-1003
- /
- 1994
In this paper, a scheme of noise cancellation using spectral subreaction method with single input in an autombile noise environment is proposed. In order to remove the changing automonile noise components form the noisy speech signal, the noise of various states is analyzed and its characteristics are presented. For the decision of speech/nonspeech and the estimation of noise spectrum, a classification function is proposed on the basis of noise analysis. This function presents the precise decision of speech/nonspeech and the optimal estimation of noise spectrum with less computation. As the result of the estimation of noise spectrum by the proposed classification function, the clean speech signal is extracted from the noisy speech signal with high signal-to-ratio.
PDF

GENERATION OF AN IMPERVIOUS MAP BY APPLYING TASSELED-CAP ENHANCEMENT USING KOMPSAT-2 IMAGE

Koh, Chang-Hwan;Ha, Sung-Ryong
- Proceedings of the KSRS Conference
- /
- 2008.10a
- /
- pp.378-381
- /
- 2008
The regulating and relaxing targets in the Land Use Regulation and Total Maximum Daily Loads are influenced by Land cover information. For the providing more accurate land information, this study attempted to generate an impervious surface map using KOMPSAT-2 image which a Korea manufactured high resolution satellite image. The classification progress of this study carried out by tasseled-cap spectral enhancement through each class extraction technique neither existing classification method. KOMPSAT-2 image of this study is enhanced by Soil Brightness Index(SBI), Green vegetation Index(GVI), None-Such wetness Index(NWI). Then ranges of extracted each index in enhanced image are determined. And then, Confidence Interval of classes was determined through the calculating Non-exceedance Probability. Spectral distributions of each class are changed according to changing of Control coefficient(${\alpha}$) at the calculated Non-exceedance Probability. Previously, Land cover classification map was generated based on established ranges of classes, and then, pervious and impervious surface was reclassified. Finally, impervious ratio of reclassified impervious surface map was calculated with blocks in the study area.
PDF

AN APPROACH TO THE TRAINING OF A SUPPORT VECTOR MACHINE (SVM) CLASSIFIER USING SMALL MIXED PIXELS

Yu, Byeong-Hyeok;Chi, Kwang-Hoon
- Proceedings of the KSRS Conference
- /
- 2008.10a
- /
- pp.386-389
- /
- 2008
It is important that the training stage of a supervised classification is designed to provide the spectral information. On the design of the training stage of a classification typically calls for the use of a large sample of randomly selected pure pixels in order to characterize the classes. Such guidance is generally made without regard to the specific nature of the application in-hand, including the classifier to be used. An approach to the training of a support vector machine (SVM) classifier that is the opposite of that generally promoted for training set design is suggested. This approach uses a small sample of mixed spectral responses drawn from purposefully selected locations (geographical boundaries) in training. A sample of such data should, however, be easier and cheaper to acquire than that suggested by traditional approaches. In this research, we evaluated them against traditional approaches with high-resolution satellite data. The results proved that it can be used small mixed pixels to derive a classification with similar accuracy using a large number of pure pixels. The approach can also reduce substantial costs in training data acquisition because the sampling locations used are commonly easy to observe.
PDF

Object-oriented Classification and QuickBird Multi-spectral Imagery in Forest Density Mapping

Jayakumar, S.;Ramachandran, A.;Lee, Jung-Bin;Heo, Joon
- Korean Journal of Remote Sensing
- /
- v.23 no.3
- /
- pp.153-160
- /
- 2007
Forest cover density studies using high resolution satellite data and object oriented classification are limited in India. This article focuses on the potential use of QuickBird satellite data and object oriented classification in forest density mapping. In this study, the high-resolution satellite data was classified based on NDVI/pixel based and object oriented classification methods and results were compared. The QuickBird satellite data was found to be suitable in forest density mapping. Object oriented classification was superior than the NDVI/pixel based classification. The Object oriented classification method classified all the density classes of forest (dense, open, degraded and bare soil) with higher producer and user accuracies and with more kappa statistics value compared to pixel based method. The overall classification accuracy and Kappa statistics values of the object oriented classification were 83.33% and 0.77 respectively, which were higher than the pixel based classification (68%, 0.56 respectively). According to the Z statistics, the results of these two classifications were significantly different at 95% confidence level.
https://doi.org/10.7780/kjrs.2007.23.3.153 인용 PDF KSCI

Two-stage Deep Learning Model with LSTM-based Autoencoder and CNN for Crop Classification Using Multi-temporal Remote Sensing Images

Kwak, Geun-Ho;Park, No-Wook
- Korean Journal of Remote Sensing
- /
- v.37 no.4
- /
- pp.719-731
- /
- 2021
This study proposes a two-stage hybrid classification model for crop classification using multi-temporal remote sensing images; the model combines feature embedding by using an autoencoder (AE) with a convolutional neural network (CNN) classifier to fully utilize features including informative temporal and spatial signatures. Long short-term memory (LSTM)-based AE (LAE) is fine-tuned using class label information to extract latent features that contain less noise and useful temporal signatures. The CNN classifier is then applied to effectively account for the spatial characteristics of the extracted latent features. A crop classification experiment with multi-temporal unmanned aerial vehicle images is conducted to illustrate the potential application of the proposed hybrid model. The classification performance of the proposed model is compared with various combinations of conventional deep learning models (CNN, LSTM, and convolutional LSTM) and different inputs (original multi-temporal images and features from stacked AE). From the crop classification experiment, the best classification accuracy was achieved by the proposed model that utilized the latent features by fine-tuned LAE as input for the CNN classifier. The latent features that contain useful temporal signatures and are less noisy could increase the class separability between crops with similar spectral signatures, thereby leading to superior classification accuracy. The experimental results demonstrate the importance of effective feature extraction and the potential of the proposed classification model for crop classification using multi-temporal remote sensing images.
https://doi.org/10.7780/kjrs.2021.37.4.4 인용 PDF KSCI HTML

A Comparison of Pixel- and Segment-based Classification for Tree Species Classification using QuickBird Imagery (QuickBird 위성영상을 이용한 수종분류에서 픽셀과 분할기반 분류방법의 정확도 비교)

Chung, Sang Young;Yim, Jong Su;Shin, Man Yong
- Journal of Korean Society of Forest Science
- /
- v.100 no.4
- /
- pp.540-547
- /
- 2011
This study was conducted to compare classification accuracy by tree species using QuickBird imagery for pixel- and segment-based classifications that have been mostly applied to classify land covers. A total of 398 points was used as training and reference data. Based on this points, the points were classified into fourteen land cover classes: four coniferous and seven deciduous tree species in forest classes, and three non-forested classes. In pixel-based classification, three images obtained by using raw spectral values, three tasseled indices, and three components from principal component analysis were produced. For the both classification processes, the maximum likelihood method was applied. In the pixel-based classification, it was resulted that the classification accuracy with raw spectral values was better than those by the other band combinations. As resulted that, the segment-based classification with a scale factor of 50% provided the most accurate classification (overall accuracy:76% and ${\hat{k}}$ value:0.74) compared to the other scale factors and pixel-based classification.
https://doi.org/10.14578/jkfs.2011.100.4.2 인용 PDF KSCI

Speech/Music Signal Classification Based on Spectrum Flux and MFCC For Audio Coder (오디오 부호화기를 위한 스펙트럼 변화 및 MFCC 기반 음성/음악 신호 분류)

Sangkil Lee;In-Sung Lee
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.16 no.5
- /
- pp.239-246
- /
- 2023
In this paper, we propose an open-loop algorithm to classify speech and music signals using the spectral flux parameters and Mel Frequency Cepstral Coefficients(MFCC) parameters for the audio coder. To increase responsiveness, the MFCC was used as a short-term feature parameter and spectral fluxes were used as a long-term feature parameters to improve accuracy. The overall voice/music signal classification decision is made by combining the short-term classification method and the long-term classification method. The Gaussian Mixed Model (GMM) was used for pattern recognition and the optimal GMM parameters were extracted using the Expectation Maximization (EM) algorithm. The proposed long-term and short-term combined speech/music signal classification method showed an average classification error rate of 1.5% on various audio sound sources, and improved the classification error rate by 0.9% compared to the short-term single classification method and 0.6% compared to the long-term single classification method. The proposed speech/music signal classification method was able to improve the classification error rate performance by 9.1% in percussion music signals with attacks and 5.8% in voice signals compared to the Unified Speech Audio Coding (USAC) audio classification method.
https://doi.org/10.17661/jkiiect.2023.16.5.239 인용 PDF HTML

Search Result 469, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)