Search | Korea Science

Emotion Transition Model based Music Classification Scheme for Music Recommendation (음악 추천을 위한 감정 전이 모델 기반의 음악 분류 기법)

Han, Byeong-Jun;Hwang, Een-Jun
- Journal of IKEEE
- /
- v.13 no.2
- /
- pp.159-166
- /
- 2009
So far, many researches have been done to retrieve music information using static classification descriptors such as genre and mood. Since static classification descriptors are based on diverse content-based musical features, they are effective in retrieving similar music in terms of such features. However, human emotion or mood transition triggered by music enables more effective and sophisticated query in music retrieval. So far, few works have been done to evaluate the effect of human mood transition by music. Using formal representation of such mood transitions, we can provide personalized service more effectively in the new applications such as music recommendation. In this paper, we first propose our Emotion State Transition Model (ESTM) for describing human mood transition by music and then describe a music classification and recommendation scheme based on the ESTM. In the experiment, diverse content-based features were extracted from music clips, dimensionally reduced by NMF (Non-negative Matrix Factorization, and classified by SVM (Support Vector Machine). In the performance analysis, we achieved average accuracy 67.54% and maximum accuracy 87.78%.
PDF

Multistage Feature-based Classification Model (다단계 특징벡터 기반의 분류기 모델)

Song, Young-Soo;Park, Dong-Chul
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.46 no.1
- /
- pp.121-127
- /
- 2009
The Multistage Feature-based Classification Model(MFCM) is proposed in this paper. MFCM does not use whole feature vectors extracted from the original data at once to classify each data, but use only groups related to each feature vector to classify separately. In the training stage, the contribution rate calculated from each feature vector group is drew throughout the accuracy of each feature vector group and then, in the testing stage, the final classification result is obtained by applying weights corresponding to the contribution rate of each feature vector group. In this paper, the proposed MFCM algorithm is applied to the problem of music genre classification. The results demonstrate that the proposed MFCM outperforms conventional algorithms by 7% - 13% on average in terms of classification accuracy.
PDF KSCI

Classification of Music Data using Fuzzy c-Means with Divergence Kernel (분산커널 기반의 퍼지 c-평균을 이용한 음악 데이터의 장르 분류)

Park, Dong-Chul
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.46 no.3
- /
- pp.1-7
- /
- 2009
An approach for the classification of music genres using a Fuzzy c-Means(FcM) with divergence-based kernel is proposed and presented in this paper. The proposed model utilizes the mean and covariance information of feature vectors extracted from music data and modelled by Gaussian Probability Density Function (GPDF). Furthermore, since the classifier utilizes a kernel method that can convert a complicated nonlinear classification boundary to a simpler linear one, he classifier can improve its classification accuracy over conventional algorithms. Experiments and results on collected music data sets demonstrate hat the proposed classification scheme outperforms conventional algorithms including FcM and SOM 17.73%-21.84% on average in terms of classification accuracy.
PDF KSCI

Music Genre Classification using Time Delay Neural Network (시간 지연 신경망을 이용한 음악 장르 분류)

이재원;조찬윤;김상균
- Journal of Korea Multimedia Society
- /
- v.4 no.5
- /
- pp.414-422
- /
- 2001
This paper proposes a classifier of music genre using time delay neural network(TDNN) fur an audio data retrieval systems. The classifier considers eight kinds of genres such as Blues, Country, Hard Core, Hard Rock, Jazz, R&B(Soul), Techno and Trash Metal. The comparative unit to classify the genres is a melody between bars. The melody pattern is extracted based un snare drum sound which represents the periodicity of rhythm effectively. The classifier is constructed with the TDNN and uses fourier transformed feature vector of the melody as input pattern. We experimented the classifier on eighty training data from ten musics for each genres and forty test data from five musics for each genres, and obtained correct classification rates of 92.5% and 60%, respectively.
PDF

Detection of Music Mood for Context-aware Music Recommendation (상황인지 음악추천을 위한 음악 분위기 검출)

Lee, Jong-In;Yeo, Dong-Gyu;Kim, Byeong-Man
- The KIPS Transactions:PartB
- /
- v.17B no.4
- /
- pp.263-274
- /
- 2010
To provide context-aware music recommendation service, first of all, we need to catch music mood that a user prefers depending on his situation or context. Among various music characteristics, music mood has a close relation with people‘s emotion. Based on this relationship, some researchers have studied on music mood detection, where they manually select a representative segment of music and classify its mood. Although such approaches show good performance on music mood classification, it's difficult to apply them to new music due to the manual intervention. Moreover, it is more difficult to detect music mood because the mood usually varies with time. To cope with these problems, this paper presents an automatic method to classify the music mood. First, a whole music is segmented into several groups that have similar characteristics by structural information. Then, the mood of each segments is detected, where each individual's preference on mood is modelled by regression based on Thayer's two-dimensional mood model. Experimental results show that the proposed method achieves 80% or higher accuracy.
https://doi.org/10.3745/KIPSTA.2010.17B.4.263 인용 PDF KSCI

Automatic Video Genre Identification Method in MPEG compressed domain

Kim, Tae-Hee;Lee, Woong-Hee;Jeong, Dong-Seok
- Proceedings of the IEEK Conference
- /
- 2002.07c
- /
- pp.1527-1530
- /
- 2002
Video summary is one of the tools which can provide the fast and effective browsing fur a lengthy video. Video summary consists of many key-frames that could be defined differently depending on the video genre it belongs to. Consequently, the video summary constructed by the uniform manner might lead into inadequate result. Therefore, identifying the video genre is the important first step in generating the meaningful video summary. We propose a new method that can classify the genre of the video data in MPEG compressed bit-stream domain. Since the proposed method operates directly on the com- pressed bit-stream without decoding the frame, it has merits such as simple calculation and short processing time. In the proposed method, only the visual information is utilized through the spatial-temporal analysis to classify the video genre. Experiments are done for 6 genres of video: Cartoon, Commercial, Music Video, News, Sports, and Talk Show. Experimental result shows more than 90% of accuracy in genre classification for the well-structured video data such as Talk Show and Sports.
PDF

A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

Kim, Hyoung-Gook
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.6 no.2
- /
- pp.95-106
- /
- 2007
This paper proposes an effective tempo feature extraction method for music information retrieval. The tempo information is modeled by the narrow-band temporal modulation components, which are decomposed into a modulation spectrum via joint frequency analysis. In implementation, the tempo feature is directly extracted from the modified discrete cosine transform coefficients, which is the output of partial MP3(MPEG 1 Layer 3) decoder. Then, different features are extracted from the amplitudes of modulation spectrum and applied to different music information retrieval tasks. The logarithmic scale modulation frequency coefficients are employed in automatic music emotion classification and music genre classification. The classification precision in both systems is improved significantly. The bit vectors derived from adaptive modulation spectrum is used in audio fingerprinting task That is proved to be able to achieve high robustness in this application. The experimental results in these tasks validate the effectiveness of the proposed tempo feature.
PDF

The Performance Analysis of On-line Audio Genre Classification (온라인 오디오 장르 분류의 성능 분석)

Yun, Ho-Won;Jang, Woo-Jin;Shin, Seong-Hyeon;Park, Ho-Chong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.11a
- /
- pp.23-24
- /
- 2016
본 논문에서는 온라인 오디오 장르 분류의 성능을 비교 분석한다. 온라인 동작을 위해 1초 단위의 오디오 신호를 입력하여 music, speech, effect 중 하나의 장르로 판단한다. 학습 방법은 GMM과 심층 신경망을 사용하며, 특성은 MFCC와 스펙트로그램을 포함하는 네 가지 종류의 벡터를 사용한다. 각 성능을 비교 분석하여 장르 분류에 적합한 학습 방법과 특성 벡터를 확인한다.
PDF

Performance Analysis of Automatic Music Genre Classification with Different Genre Data (음악 장르 분류법에 따른 자동판별 성능분석)

Song, Min-Kyun;Moon, Chang-Bae;Kim, Hyun-Soo;Kim, Byeong-Man
- Proceedings of the Korean Information Science Society Conference
- /
- 2011.06c
- /
- pp.288-291
- /
- 2011
기존 음악 장르 분류의 경우 음악의 특징 추출 또는 기계학습을 중점적으로 연구되어왔다. 하지만 자동 분류에 필요한 장르 데이터는 음악을 제공하는 웹 사이트마다 다르고, 각 웹 사이트의 장르 분류는 해당 음악이 아닌 앨범의 장르를 표시한다. 보다 나은 자동 분류를 위해서는 일관된 장르 데이터의 제공이 필요한데, 본 논문에서는 이러한 연구의 일환으로 여러 웹사이트에서 수집한 장르 데이터에 따른 판별 성능을 분석하였다. 분석 결과 장르 분류 방법에 따라 신경망 학습 및 판별성능이 큰 차이가 발생하였다.

Content-based Music Information Retrieval using Pitch Histogram (Pitch 히스토그램을 이용한 내용기반 음악 정보 검색)

박만수;박철의;김회린;강경옥
- Journal of Broadcast Engineering
- /
- v.9 no.1
- /
- pp.2-7
- /
- 2004
In this paper, we proposed the content-based music information retrieval technique using some MPEG-7 low-level descriptors. Especially, pitch information and timbral features can be applied in music genre classification, music retrieval, or QBH(Query By Humming) because these can be modeling the stochasticpattern or timbral information of music signal. In this work, we restricted the music domain as O.S.T of movie or soap opera to apply broadcasting system. That is, the user can retrievalthe information of the unknown music using only an audio clip with a few seconds extracted from video content when background music sound greeted user's ear. We proposed the audio feature set organized by MPEG-7 descriptors and distance function by vector distance or ratio computation. Thus, we observed that the feature set organized by pitch information is superior to timbral spectral feature set and IFCR(Intra-Feature Component Ratio) is better than ED(Euclidean Distance) as a vector distance function. To evaluate music recognition, k-NN is used as a classifier
PDF KSCI

Search Result 53, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)