• Title/Summary/Keyword: 음악 정보 추출

Search Result 150, Processing Time 0.028 seconds

Representative Melodies Retrieval using Waveform and FFT Analysis of Audio (오디오의 파형과 FFT 분석을 이용한 대표 선율 검색)

  • Chung, Myoung-Bum;Ko, Il-Ju
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.12
    • /
    • pp.1037-1044
    • /
    • 2007
  • Recently, we extract the representative melody of the music and index the music to reduce searching time at the content-based music retrieval system. The existing study has used MIDI data to extract a representative melody but it has a weak point that can use only MIDI data. Therefore, this paper proposes a representative melody retrieval method that can be use at all audio file format and uses digital signal processing. First, we use Fast Fourier Transform (FFT) and find the tempo and node for the representative melody retrieval. And we measure the frequency of high value that appears from PCM Data of each node. The point which the high value is gathering most is the starting point of a representative melody and an eight node from the starting point is a representative melody section of the audio data. To verity the performance of the method, we chose a thousand of the song and did the experiment to extract a representative melody from the song. In result, the accuracy of the extractive representative melody was 79.5% among the 737 songs which was found tempo.

Contents prediction method applying automatically extracted user groups based on users' consuming logs about contents (자동 추출된 사용자 그룹을 이용한 콘텐츠 및 사용자 히스토리 기반의 사용자 별 콘텐츠 추천 방법)

  • Shin, Saim;Yang, Chang-Mo;Jang, Se-Jin;Lee, Seok-Pil
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.55-58
    • /
    • 2012
  • 본 논문은 사용자의 각종 멀티미디어 콘텐츠 소비 히스토리를 수집하여 체계화 및 패턴 분석을 수행하고, 이를 바탕으로 사용자가 선호할 것으로 예측되는 멀티미디어 콘텐츠들을 추출하여 제공하는 콘텐츠 추천 시스템에 관한 연구이다. 본 논문에서는 콘텐츠 소비와 연관된 사용자 로그와 엔진에서 자동 추출한 사용자 그룹을 통하여 콘텐츠 추천을 수행한다. 각 사용자들의 선호정보 데이터를 분석하여 선호정보 패턴이 유사한 사용자들을 사용자 그룹으로 정의하고, 각 사용자들이 속한 사용자 그룹의 사용자 로그를 활용하여 사용자별 선호 콘텐츠를 예측한다. 본 시스템은 웹 또는 모바일 환경에서 음악, 방송, 광고, 기사 등의 방대하고 다양한 콘텐츠를 복합적으로 사용자들에게 선별하여 제공해 주며, 이들의 연관성과 사용자의 콘텐츠 선호패턴을 반영한 개인 맞춤형 콘텐츠 추천 엔진은 사용자가 선호할만한 콘텐츠들을 추천하여 사용자의 콘텐츠 소비 시의 만족도를 높여줄 수 있다.

  • PDF

Efficient Multiplex Audio Monitoring System in Digital Broadcasting (디지털 방송에서 효율적인 다중 오디오 모니터링 시스템)

  • Kim, Yoo-Won;Sohn, Surg-Won;Jo, Geun-Sik
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.91-98
    • /
    • 2008
  • In digital broadcasting, it is possible to multiplex maximum one hundred audio or music programs into MPEG-2 transport stream, which is suitable for transmitting through one channel. In order to check if multiplex music programs are transmitted well, we need a multiplex audio monitoring system that monitors the programs in real-time. In analog broadcasting, we have used hardware-based audio monitoring system for a small number music programs. However, the effectiveness of hardware-based audio monitoring system from the cost and function viewpoint is so low that a new system is needed for digital broadcasting. In this paper, we have designed and implemented a software-based audio monitoring system to satisfy these requirements. In this implementation, only one PC is used without other hardware facilities, and the system monitors digital broadcasting music programs effectively. Transmitted digital broadcasting streams are demultiplexed into many music programs and the realtime value of audio level and packet error information for these programs are displayed in the screen. Thus, the system detects and shows the abnormal transmitting programs automatically. Simulation results show that effective realtime multiplex audio monitoring is possible for digital broadcasting music programs.

  • PDF

A Study on the Efficient Search of an Audio Database using Musical Interval Contour (음정 곡선을 이용한 효율적인 오디오 데이터베이스 탐색에 관한 연구)

  • 지정규;오해석
    • The Journal of Information Technology and Database
    • /
    • v.4 no.2
    • /
    • pp.97-104
    • /
    • 1998
  • 본 논문은 디지털 오디오 도서관에 대규모 선율 데이터베이스로부터 임의의 곡을 효율적으로 탐색하기 위하여 음정곡선을 색인키로 사용하는 방법에 대해 기술했다. 사용자가 검색하고자 하는 음악의 일부 선율을 노래하면 입력된 음신호를 인식하여 음높이 정보를 추출한다. 그리고, 음표간의 음정을 계산하여 음표순으로 배열함으로써 음정 곡선을 만든다. 제안한 은표열 탐색 알고리즘에 생성된 음정 곡선을 탐색 패턴으로 입력하여 선율 데이터베이스의 음표열을 비교 조사한다. 그러면 근사 음정 곡선을 가진 후보곡을 탐색할 수 있다. 제안한 음표열 탐색 알고리즘은 실험을 통해 동적 프로그래밍 및 상태 대조 알고리즘과 비교한 결과 탐색 시간이 2배이상 향상되었다.

Development of Music Recommendation System based on Customer Sentiment Analysis (소비자 감성 분석 기반의 음악 추천 알고리즘 개발)

  • Lee, Seung Jun;Seo, Bong-Goon;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.197-217
    • /
    • 2018
  • Music is one of the most creative act that can express human sentiment with sound. Also, since music invoke people's sentiment to get empathized with it easily, it can either encourage or discourage people's sentiment with music what they are listening. Thus, sentiment is the primary factor when it comes to searching or recommending music to people. Regard to the music recommendation system, there are still lack of recommendation systems that are based on customer sentiment. An algorithm's that were used in previous music recommendation systems are mostly user based, for example, user's play history and playlists etc. Based on play history or playlists between multiple users, distance between music were calculated refer to basic information such as genre, singer, beat etc. It can filter out similar music to the users as a recommendation system. However those methodology have limitations like filter bubble. For example, if user listen to rock music only, it would be hard to get hip-hop or R&B music which have similar sentiment as a recommendation. In this study, we have focused on sentiment of music itself, and finally developed methodology of defining new index for music recommendation system. Concretely, we are proposing "SWEMS" index and using this index, we also extracted "Sentiment Pattern" for each music which was used for this research. Using this "SWEMS" index and "Sentiment Pattern", we expect that it can be used for a variety of purposes not only the music recommendation system but also as an algorithm which used for buildup predicting model etc. In this study, we had to develop the music recommendation system based on emotional adjectives which people generally feel when they listening to music. For that reason, it was necessary to collect a large amount of emotional adjectives as we can. Emotional adjectives were collected via previous study which is related to them. Also more emotional adjectives has collected via social metrics and qualitative interview. Finally, we could collect 134 individual adjectives. Through several steps, the collected adjectives were selected as the final 60 adjectives. Based on the final adjectives, music survey has taken as each item to evaluated the sentiment of a song. Surveys were taken by expert panels who like to listen to music. During the survey, all survey questions were based on emotional adjectives, no other information were collected. The music which evaluated from the previous step is divided into popular and unpopular songs, and the most relevant variables were derived from the popularity of music. The derived variables were reclassified through factor analysis and assigned a weight to the adjectives which belongs to the factor. We define the extracted factors as "SWEMS" index, which describes sentiment score of music in numeric value. In this study, we attempted to apply Case Based Reasoning method to implement an algorithm. Compare to other methodology, we used Case Based Reasoning because it shows similar problem solving method as what human do. Using "SWEMS" index of each music, an algorithm will be implemented based on the Euclidean distance to recommend a song similar to the emotion value which given by the factor for each music. Also, using "SWEMS" index, we can also draw "Sentiment Pattern" for each song. In this study, we found that the song which gives a similar emotion shows similar "Sentiment Pattern" each other. Through "Sentiment Pattern", we could also suggest a new group of music, which is different from the previous format of genre. This research would help people to quantify qualitative data. Also the algorithms can be used to quantify the content itself, which would help users to search the similar content more quickly.

Melody Note - Music Score Editor and Play System (악보작성 및 재생 시스템)

  • Kim, Tae-Ki;Lee, Dae-jeong;Park, Mi-Ra;Min, Jun-Ki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.1059-1062
    • /
    • 2009
  • As the electronic processing of music gradually is developed, there has been growing interest in automatical input of music. As a result, various researches which input music in the computer has been studied. However, previous studies have drawbacks that only the experts can do it. In other words, if beginners would like to use traditional production program of music scores than prior knowledge is required. To resolve this, we propose system painting music scores automatically using a bandwidth of soundsource, after extracting the voice sounds created by amateurs. The System provides amateurs with convenience so that they can compose. As well as, the System provides the ability to play music that produced by the computer. By using the system, amateurs can compose using voice and simple system handling. And, they can make a music that plays desired instruments.

  • PDF

Classification of Music Data using Fuzzy c-Means with Divergence Kernel (분산커널 기반의 퍼지 c-평균을 이용한 음악 데이터의 장르 분류)

  • Park, Dong-Chul
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.3
    • /
    • pp.1-7
    • /
    • 2009
  • An approach for the classification of music genres using a Fuzzy c-Means(FcM) with divergence-based kernel is proposed and presented in this paper. The proposed model utilizes the mean and covariance information of feature vectors extracted from music data and modelled by Gaussian Probability Density Function (GPDF). Furthermore, since the classifier utilizes a kernel method that can convert a complicated nonlinear classification boundary to a simpler linear one, he classifier can improve its classification accuracy over conventional algorithms. Experiments and results on collected music data sets demonstrate hat the proposed classification scheme outperforms conventional algorithms including FcM and SOM 17.73%-21.84% on average in terms of classification accuracy.

Audio fingerprint matching based on a power weight (파워 가중치를 이용한 오디오 핑거프린트 정합)

  • Seo, Jin Soo;Kim, Junghyun;Kim, Hyemi
    • The Journal of the Acoustical Society of Korea
    • /
    • v.38 no.6
    • /
    • pp.716-723
    • /
    • 2019
  • Fingerprint matching accuracy is essential in deploying a music search service. This paper deals with a method to improve fingerprint matching accuracy by utilizing an auxiliary information which is called power weight. Power weight is an expected robustness of each hash bit. While the previous power mask binarizes the expected robustness into strong and weak bits, the proposed method utilizes a real-valued function of the expected robustness as weights for fingerprint matching. As a countermeasure to the increased storage cost, we propose a compression method for the power weight which has strong temporal correlation. Experiments on the publicly-available music datasets confirmed that the proposed power weight is effective in improving fingerprint matching performance.

A Study on Vocal Removal Scheme of SAOC Using Harmonic Information (하모닉 정보를 이용한 SAOC의 보컬 신호 제거 방법에 관한 연구)

  • Park, Ji-Hoon;Jang, Dae-Geun;Hahn, Min-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.10
    • /
    • pp.1171-1179
    • /
    • 2013
  • Interactive audio service provide with audio generating and editing functionality according to user's preference. A spatial audio object coding (SAOC) scheme is audio coding technology that can support the interactive audio service with relatively low bit-rate. However, when the SAOC scheme remove the specific one object such as vocal object signal for Karaoke mode, the scheme support poor quality because the removed vocal object remain in the SAOC-decoded background music. Thus, we propose a new SAOC vocal harmonic extranction and elimination technique to improve the background music quality in the Karaoke service. Namely, utilizing the harmonic information of the vocal object, we removed the harmonics of the vocal object remaining in the background music. As harmonic parameters, we utilize the pitch, MVF(maximum voiced frequency), and harmonic amplitude. To evaluate the performance of the proposed scheme, we perform the objective and subjective evaluation. As our experimental results, we can confirm that the background music quality is improved by the proposed scheme comparing with the SAOC scheme.

An Audio watermarking method robust against time- and frequency- scaling (피치 및 시간 스케일링에 강인한 오디오 워터마킹 기법)

  • Park Changmok;Byun Youngbae;Kim Jongweon;Choi Jonguk
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.335-338
    • /
    • 2002
  • 본 연구에서는 주파수 영역에서의 확산 스펙트럼 방식을 이용한 오디오 워터마킹 기법을 사용하고 있다. 워터마크 삽입은 오디오 신호를 MCLT(Modulated Complex Lapped Transform)로 분석한 후, 특정 주파수 영역의 진폭에 삽입되며 추출은 상관도를 이용하여 추출하게 된다. 워터마크 삽입은 44.1 kHz의 음악에 80 bits의 정보가 4초 단위로 반복적으로 삽입되며, 추출에서는 무작위로 추출된 8초 분량의 오디오 신호로부터 80 bits 비트 열과의 상관도를 계산하여 선정된 문턱 값을 초과하게 되면 워터마크가 존재하는 것으로 판단하게 된다 피치 스케일에 대응하기 위하여 120개 정도의 탐색을 수행하며, 시간 스케일에 대응하기 위하여 상관도의 지역 최대 점을 추출하고, 이러한 지역 최대 점들로부터 추출된 비트 열과 실제 비트 열과의 상관도를 계산하게 된다. 그러나 추출된 비트 열은 삽입 에러와 삭제 에러를 가질 수 있기 때문에 이러한 비트 열과의 최대 상관도를 구하기 위하여 본 연구에서는 동적계획법에 의한 최대 상관도 추출 알고리즘을 제시한다. 제안된 방법은 피치 및 시간 스케일링 변환 뿐만 아니라, 오디오 압축에도 견고함을 보인다.

  • PDF