• Title/Summary/Keyword: Music Algorithm

Search Result 344, Processing Time 0.028 seconds

An Efficient Algorithm for Localizing 3D Narrowband Multiple Sources (다중표적의 효과적인 3차원 위치추정 알고리듬)

  • Lee Chul-Mok;Lee Jong-Hwan;Lee Su-Hyung;Yun Kyung-Sik;Lee Kyun-Kyung
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.228-231
    • /
    • 1999
  • 3차원 공간상의 표적의 위치는 방위각, 고각, 거리의 세가지요소로 나타내어 질 수 있다. 이 논문에서는 등각적 선배열 센서로 이루어진 3개의 부분센서배열을 이용한 3차원 표적의 위치추정 알고리듬을 제안하였다. 원거리 표적의 방위각 추정 알고리듬으로 근거리 표적의 방위각을 추정하면 추정된 방위각은 실제 근거리 표적의 방위각과 고각과 거리의 비선형 대수적 관계식으로 주어진다. 제안한 알고리듬은 3개의 부분센서배열에서 각각 표적을 원거리에 있다고 가정하고 원거리입체각을 추정하여 위의 대수적 관계식을 얻은 후 이들 관계식을 연립하여 실제 근거리 표적의 위치를 추정하였다. 다중표적의 경우 각각의 부분센서배열에서 추정한 원거리입체각이 어떤 표적에 대한 추정치인지 연관시켜주는 알고리듬이 필요하다. 이 논문에서는 추정한 원거리입체각의 모든 조합으로부터 3차원 MUSIC 스펙트럼값을 비교하여 그 중 표적의 개수만큼을 선별하여 다중표적의 위치를 추정하였다.

  • PDF

Classification of TV Program Scenes Based on Audio Information

  • Lee, Kang-Kyu;Yoon, Won-Jung;Park, Kyu-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.3E
    • /
    • pp.91-97
    • /
    • 2004
  • In this paper, we propose a classification system of TV program scenes based on audio information. The system classifies the video scene into six categories of commercials, basketball games, football games, news reports, weather forecasts and music videos. Two type of audio feature set are extracted from each audio frame-timbral features and coefficient domain features which result in 58-dimensional feature vector. In order to reduce the computational complexity of the system, 58-dimensional feature set is further optimized to yield l0-dimensional features through Sequential Forward Selection (SFS) method. This down-sized feature set is finally used to train and classify the given TV program scenes using κ -NN, Gaussian pattern matching algorithm. The classification result of 91.6% reported here shows the promising performance of the video scene classification based on the audio information. Finally, the system stability problem corresponding to different query length is investigated.

Bearing Estimation of Multiple Wide Band Signals using Modified Algorithms in Multipath Environment (다경로인 경우 개선된 알고리듬을 이용한 다수의 광대역 신호의 입사각 추정)

  • Cho, Jeong-Kwon;Park, Young-Chul;Cha, Il-Whan;Youn, Dae-Hee
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.3-6
    • /
    • 1988
  • The UCERSS algorithm is an extended MUSIC which is used to estimate incident angles of multiple wide band signals. The purpose of this paper is to extend the UCERSS in order to estimate the direction of arrivals of multiple wide band signals in multipath environment. The modifications of the UCERSS result in the wide band spatial smoothing and the UNSS approaches. Computer simulation results indicate that the performances of the UNSS are superior to those of the UCERSS and the wide band spatial smoothing method.

  • PDF

Design and Implementation of an Algorithm for Adjusting Playing Speed of Music according to Pace Speed (걸음 속도를 이용한 음악재생 속도 조절 알고리즘 설계 및 구현)

  • Seo, Sang-Hyun;Jang, Si-Woong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.220-223
    • /
    • 2015
  • 기존의 제품에서는 음악을 들으며 운동하는 과정에서 음악속도에 맞게 운동을 하는 방법을 사용하였다. 사용자가 운동중에 직접 음악 재생을 조절하는 방법은 좀 더 격한 음악을 재생하고자 했을 때 음악을 검색중 사용자의 운동의 흐름이 끊어지는 경우가 빈번하게 발생한다. 그러나 사용자의 걸음 속도를 이용해 음악 재생속도를 더 빠르게 한다면 효율적인 운동이 될 것이다. 본 논문에서는 신발의 깔창에 장착된 1개 이상의 압전센서를 활용하여 아두이노에서 걸음 수를 수집하고 수집된 압전센서 데이터를 이용하여 걸음 수를 이용한 걸음 속도의 측정 과정을 안드로이드 스마트폰에서 처리한다. 이렇게 측정된 걸음 속도를 통해 안드로이드 스마트폰에서 음악 재생 속도를 결정하여 사용자에게 쾌적한 운동 환경을 제공하는 것을 목적으로 한다.

  • PDF

Player of Song by Face Recognition (표정인식에 의한 노래 플레이어)

  • Nam, Soo-Tai;Shin, Seong-Yoon;Lee, Hyun-chang;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.184-185
    • /
    • 2018
  • Face Song Player, which is a system that recognizes the facial expression of an individual and plays music that is appropriate for such person, is presented. It studies information on the facial contour lines and extracts an average, and acquires the facial shape information. MUCT DB was used as the DB for learning. For the recognition of facial expression, an algorithm was designed by using the differences in the characteristics of each of the expressions on the basis of expressionless images.

  • PDF

Considerations for Design and Implementation of a RF Emitter Localization System with Array Antennas

  • Lim, Deok Won;Lim, Soon;Chun, Sebum;Heo, Moon Beom
    • Journal of Positioning, Navigation, and Timing
    • /
    • v.5 no.1
    • /
    • pp.37-45
    • /
    • 2016
  • In this paper, design and implementation issues for a network-oriented RF emitter localization system with array antenna are discussed. For hardware, the problem of array mismatch and RF/IF channel mismatch are introduced and the calibration schemes for solving those problems are also provided. For software, it is explained how to overcome the drawback of conventional MUltiple Signal Identification and Classification (MUSIC) algorithm in a point of identifying the number of received signals and problems such as Data Association Problem and Ghost Node Problem in regard to multiple emitter localization are presented with some approaches for getting around those problems. Finally, for implementation, a criterion for arranging each of sensors and a requirement for alignment of array antenna' orientation are also given.

Separation of Single Channel Mixture Using Time-domain Basis Functions

  • 장길진;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.146-146
    • /
    • 2002
  • We present a new technique for achieving source separation when given only a single channel recording. The main idea is based on exploiting the inherent time structure of sound sources by learning a priori sets of time-domain basis functions that encode the sources in a statistically efficient manner. We derive a learning algorithm using a maximum likelihood approach given the observed single channel data and sets of basis functions. For each time point we infer the source parameters and their contribution factors. This inference is possible due to the prior knowledge of the basis functions and the associated coefficient densities. A flexible model for density estimation allows accurate modeling of the observation, and our experimental results exhibit a high level of separation performance for simulated mixtures as well as real environment recordings employing mixtures of two different sources. We show separation results of two music signals as well as the separation of two voice signals.

Development of Music Information Retrieval System Using Differentiation of Frequency and Cosine Similarity Algorithm (음원의 주파수 변화율과 코사인 유사도 알고리즘을 이용한 음악 검색 시스템 개발)

  • Song, Ji Won;Lim, Eun Joo;Ha, Seong Yoon;Woo, Gyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.1027-1030
    • /
    • 2014
  • 대중음악과 스마트폰 기술이 발달하면서 사용자가 직접 음악을 검색할 수 있는 내용 기반 음악 검색 기술이 연구되었다. 그 결과 허밍을 사용하여 음악을 검색할 수 있는 음악 검색 시스템이 개발되었지만, 검색 속도가 느리고 검색 결과가 부정확한 시스템이 많다. 본 논문에서는 음원의 주파수 변화율을 측정하고 이를 코사인 유사도 알고리즘을 이용하여 유사도를 측정하는 음악 검색 시스템을 설계하였고, 각 설계요소를 설명한다. 새로 설계한 음악 검색 시스템을 기반으로 한 실험을 통하여 기존의 음악 검색 시스템과 유사한 성능이 나오는 것을 확인하였으며 본 논문에서 제시한 새로운 음악 검색 시스템은 기존 음악 검색 시스템보다 구조가 단순하면서도 유사한 결과를 내고 있다.

Antipersonnel Landmine Detection Using Ground Penetrating Radar

  • Shrestha, Shanker-Man;Arai, Ikuo;Tomizawa, Yoshiyuki;Gotoh, Shinji
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.1064-1066
    • /
    • 2003
  • In this paper, ground penetrating radar (GPR), which has the capability to detect non metal and plastic mines, is proposed to detect and discriminate antipersonnel (AP) landmines. The time domain GPR - Impulse radar and frequency domain GPR - SFCW (Stepped Frequency Continuous Wave) radar is utilized for metal and non-metal landmine detection and its performance is investigated. Since signal processing is vital for target reorganization and clutter rejection, we implemented the MUSIC (Multiple Signal Classification) algorithm for the signal processing of SFCW radar data and SAR (Synthetic Aperture Radar) processing method for the signal processing of Impulse radar data.

  • PDF

Song Player by Distance Measurement from Face (얼굴에서 거리 측정에 의한 노래 플레이어)

  • Shin, Seong-Yoon;Lee, Min-Hye;Shin, Kwang-Seong;Lee, Hyun-Chang
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.667-669
    • /
    • 2022
  • In this paper, Face Song Player, which is a system that recognizes the facial expression of an individual and plays music that is appropriate for such person, is presented. It studies information on the facial contour lines and extracts an average, and acquires the facial shape information. MUCT DB was used as the DB for learning. For the recognition of facial expression, an algorithm was designed by using the differences in the characteristics of each of the expressions on the basis of expressionless images.

  • PDF