• Title/Summary/Keyword: Speech and music discrimination

Search Result 24, Processing Time 0.019 seconds

A Study of Automatic Detection of Music Signal from Broadcasting Audio Signal (방송 오디오 신호로부터 음악 신호 검출에 관한 연구)

  • Yoon, Won-Jung;Park, Kyu-Sik
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.5
    • /
    • pp.81-88
    • /
    • 2010
  • In this paper, we proposed an automatic music/non-music signal discrimination system from broadcasting audio signal as a preliminary study of building a sound source monitoring system in real broadcasting environment. By reflecting human speech articulation characteristics, we used three simple time-domain features such as energy standard deviation, log energy standard deviation and log energy mean. Based on the experimental threshold values of each feature, we developed a rule-based algorithm to classify music portion of the input audio signal. For the verification of the proposed algorithm, actual FM broadcasting signal was recorded for 24 hours and used as source input audio signal. From the experimental results, the proposed system can effectively recognize music section with the accuracy of 96% and non-music section with that of 87%, where the performance is good enough to be used as a pre-process module for the a sound source monitoring system.

A Study on Real-time Discrimination of FM Radio Broadcast Speech/Music (실시간 FM 방송중 음악/음성 검출에 관한 연구)

  • 황진만;강동욱;김기두
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2136-2139
    • /
    • 2003
  • 본 논문은 FM 라디오 방송중의 오디오 신호를 블록단위로 음악 및 음성을 검출하는 알고리즘에 대한 것으로, 이를 기반으로 방송중의 노래(가요, 팝, 클래식‥‥)만을 자동으로 인식하여 녹음하는 알고리즘을 개발한다. 본 논문에서는 기존에 제안되었던 것[1-4]과 같이 단지 음악과 음성을 구분함과 동시에 음악구간의 논리적 조합으로 이루어진 노래를 자동으로 인식하여 녹음하는 것을 알고리즘의 최종 목표로 한다. 알고리즘의 접근 역시 기존의 음소단위의 모델링을 거치는 GMM 기반의 접근이 아니기 때문에 모델링에 대한 훈련과정이 필요 없고, 시간영역에서의 오디오신호가 가지고 있는 직관적인 특징을 분석함으로써 비교적 적은 연산으로 실시간 구현이 가능하다.

  • PDF

The Relationship Between Perception of Prosody, Pitch Discrimination, and Melodic Contour Identification in Cochlear Implants Recipients (인공와우이식 난청인의 말소리 운율변화에 따른 구어 이해와 음도 변별, 선율윤곽 확인 간 관련성)

  • Kim, Eun Yeon;Moon, Il Joon;Cho, Yang-sun;Chung, Won-ho;Hong, Sung Hwa
    • Journal of Music and Human Behavior
    • /
    • v.14 no.2
    • /
    • pp.1-18
    • /
    • 2017
  • The relationships between the ability to understand changes in meaning depending on the prosody of spoken words and the ability to perceive pitch and melodic contour in cochlear implants (CI) recipients were examined. Fifteen postlingual CI recipients were measured in terms of speech prosody perception, speech perception, pitch discrimination (PD), and melody contour identification (MCI). The speech prosody perception test consists of words with positive (PW) and neutral meaning (NW). Participants were asked to identify the meaning of words depending on the conditions of positive and negative prosody. The MCI consists of subtests 1 and 2 with different chance levels to choose. Then, the relationships between speech prosody perception, speech perception, PD, and MCI performance were analyzed. There was a significant difference in identifying the meaning of words expressed in a different prosody between the PW and NW conditions. Speech prosody perception showed a significant correlation with MCI 1 while there was no significant relationship with speech perception. Although speech perception may be possible after CI, limited spoken word comprehension due to decreased sensitivity for prosodic changes may persist in CI recipients. In addition, there was a limitation in perception of melodic contour change compared to pitch discrimination, which is related to speech prosody perception.