• Title/Summary/Keyword: music information retrieval

Search Result 107, Processing Time 0.03 seconds

A Scheme for Content-based Music Element Retrieval Using Probabilistic Latent Component Analysis and Acoustic Descriptor (확률적 은닉 성분 분석 및 음향 기술자를 사용한 내용 기반 음악 요소 검색 방법)

  • Han, Byeong-Jun;Lee, Kyo-Gu;Rho, Seung-Min;Hwang, Een-Jun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.475-478
    • /
    • 2011
  • 지금까지 음악 정보 검색을 위한 다양한 내용 기반 음악 검색 및 비교 방법이 제안되었다. 그런데, 기존 연구들은 질의 방식 및 검색 카테고리가 변화함에 따라 상이한 방법을 제시하고 있어 음악 검색 방법을 통합하는 데에 한계가 있다. 이러한 문제를 해결하기 위해, 본고에서는 내용 기반 음악 검색의 일반화를 위한 내용 기반 음악 요소 검색(CBMER) 방법을 제안하였다. 제안 방법에서는 확률적 은닉 성분 분석(PLCA)을 사용하여 음원을 분해하고, 각 분해 요소로부터 오디오 특성을 추출하였다. 제안 방법을 사용하여 다양한 질의 방식 및 검색 카테고리로 내용 기반 음악 요소 검색이 가능함을 보이기 위해, 남성/여성의 목소리로부터 질의를 생성하여 목소리 성별에 따른 음악을 검색하는 실험을 수행하고 그 결과를 분석하였다.

Retrieval System Adopting Statistical Feature of MPEG Video (MPEG 비디오의 통계적 특성을 이용한 검색 시스템)

  • Yu, Young-Dal;Kang, Dae-Seong;Kim, Dai-Jin
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.38 no.5
    • /
    • pp.58-64
    • /
    • 2001
  • Recently many informations are transmitted ,md stored as video data, and they are on the rapid increase because of popularization of high performance computer and internet. In this paper, to retrieve video data, shots are found through analysis of video stream and the method of detection of key frame is studied. Finally users can retrieve the video efficiently. This Paper suggests a new feature that is robust to object movement in a shot and is not sensitive to change of color in boundary detection of shots, and proposes the characterizing value that reflects the characteristic of kind of video (movie, drama, news, music video etc,). The key frames are pulled out from many frames by using the local minima and maxima of differential of the value. After original frame(not de image) are reconstructed for key frame, indexing process is performed through computing parameters. Key frames that arc similar to user's query image arc retrieved through computing parameters. It is proved that the proposed methods are better than conventional method from experiments. The retrieval accuracy rate is so high in experiments.

  • PDF

Efficient Similarity Search in Multi-attribute Time Series Databases (다중속성 시계열 데이타베이스의 효율적인 유사 검색)

  • Lee, Sang-Jun
    • The KIPS Transactions:PartD
    • /
    • v.14D no.7
    • /
    • pp.727-732
    • /
    • 2007
  • Most of previous work on indexing and searching time series focused on the similarity matching and retrieval of one-attribute time series. However, multimedia databases such as music, video need to handle the similarity search in multi-attribute time series. The limitation of the current similarity models for multi-attribute sequences is that there is no consideration for attributes' sequences. The multi-attribute sequences are composed of several attributes' sequences. Since the users may want to find the similar patterns considering attributes's sequences, it is more appropriate to consider the similarity between two multi-attribute sequences in the viewpoint of attributes' sequences. In this paper, we propose the similarity search method based on attributes's sequences in multi-attribute time series databases. The proposed method can efficiently reduce the search space and guarantees no false dismissals. In addition, we give preliminary experimental results to show the effectiveness of the proposed method.

A Study of Developing a Musician Retrieval System Using Topic Maps (토픽맵기반의 뮤지션 검색시스템 구축)

  • Kwon, Jin-Man;Chung, Myung-Bum;Sung, Bo-Kyung;Kim, Jung-Soo;Ko, Il-Ju
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.760-765
    • /
    • 2008
  • The purpose of this paper is to propose a powerful alternative in designing knowledge portals using Topic Maps(TM). All information is processing for one topic, about each topics define to association and composite flow of information. and all topics about add occurrence to use in the OpenAPI. Also represent to UI of graphic for intuitional representation, and using JavaScript for Cross-Browsing. so that not using XTM of a standard Topic Maps(TM) and using JSON for a simple represent to data. The results made for intuitional process UI and extensive limits to display for new information before until now musician search system. In future positive to musician search for music.

  • PDF

Named Entity Recognition and Dictionary Construction for Korean Title: Books, Movies, Music and TV Programs (한국어 제목 개체명 인식 및 사전 구축: 도서, 영화, 음악, TV프로그램)

  • Park, Yongmin;Lee, Jae Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.7
    • /
    • pp.285-292
    • /
    • 2014
  • A named entity recognition method is used to improve the performance of information retrieval systems, question answering systems, machine translation systems and so on. The targets of the named entity recognition are usually PLOs (persons, locations and organizations). They are usually proper nouns or unregistered words, and traditional named entity recognizers use these characteristics to find out named entity candidates. The titles of books, movies and TV programs have different characteristics than PLO entities. They are sometimes multiple phrases, one sentence, or special characters. This makes it difficult to find the named entity candidates. In this paper we propose a method to quickly extract title named entities from news articles and automatically build a named entity dictionary for the titles. For the candidates identification, the word phrases enclosed with special symbols in a sentence are firstly extracted, and then verified by the SVM with using feature words and their distances. For the classification of the extracted title candidates, SVM is used with the mutual information of word contexts.

Feature Selection for Multi-Class Genre Classification using Gaussian Mixture Model (Gaussian Mixture Model을 이용한 다중 범주 분류를 위한 특징벡터 선택 알고리즘)

  • Moon, Sun-Kuk;Choi, Tack-Sung;Park, Young-Cheol;Youn, Dae-Hee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.10C
    • /
    • pp.965-974
    • /
    • 2007
  • In this paper, we proposed the feature selection algorithm for multi-class genre classification. In our proposed algorithm, we developed GMM separation score based on Gaussian mixture model for measuring separability between two genres. Additionally, we improved feature subset selection algorithm based on sequential forward selection for multi-class genre classification. Instead of setting criterion as entire genre separability measures, we set criterion as worst genre separability measure for each sequential selection step. In order to assess the performance proposed algorithm, we extracted various features which represent characteristics such as timbre, rhythm, pitch and so on. Then, we investigate classification performance by GMM classifier and k-NN classifier for selected features using conventional algorithm and proposed algorithm. Proposed algorithm showed improved performance in classification accuracy up to 10 percent for classification experiments of low dimension feature vector especially.

Blind Rhythmic Source Separation (블라인드 방식의 리듬 음원 분리)

  • Kim, Min-Je;Yoo, Ji-Ho;Kang, Kyeong-Ok;Choi, Seung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.8
    • /
    • pp.697-705
    • /
    • 2009
  • An unsupervised (blind) method is proposed aiming at extracting rhythmic sources from commercial polyphonic music whose number of channels is limited to one. Commercial music signals are not usually provided with more than two channels while they often contain multiple instruments including singing voice. Therefore, instead of using conventional modeling of mixing environments or statistical characteristics, we should introduce other source-specific characteristics for separating or extracting sources in the under determined environments. In this paper, we concentrate on extracting rhythmic sources from the mixture with the other harmonic sources. An extension of nonnegative matrix factorization (NMF), which is called nonnegative matrix partial co-factorization (NMPCF), is used to analyze multiple relationships between spectral and temporal properties in the given input matrices. Moreover, temporal repeatability of the rhythmic sound sources is implicated as a common rhythmic property among segments of an input mixture signal. The proposed method shows acceptable, but not superior separation quality to referred prior knowledge-based drum source separation systems, but it has better applicability due to its blind manner in separation, for example, when there is no prior information or the target rhythmic source is irregular.