• Title/Summary/Keyword: Audio compensation

Search Result 24, Processing Time 0.022 seconds

Development of Audio Melody Extraction and Matching Engine for MIREX 2011 tasks

  • Song, Chai-Jong;Jang, Dalwon;Lee, Seok-Pil;Park, Hochong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.164-166
    • /
    • 2012
  • In this paper, we proposed a method for extracting predominant melody of polyphonic music based on harmonic structure. Harmonic structure is an important feature parameter of monophonic signal that has spectral peaks at the integer multiples of its fundamental frequency. We extract all fundamental frequency candidates contained in the polyphonic signal by verifying the required condition of harmonic structure. Then, we combine those harmonic peaks corresponding to each extracted fundamental frequency and assign a rank to each after calculating its harmonic average energy. We run pitch tracking based on the rank of extracted fundamental frequency and continuity of fundamental frequency, and determine the predominant melody. For the query by singing/humming (QbSH) task, we proposed Dynamic Time Warping (DTW) based matching engine. Our system reduces false alarm by combining the distances of multiple DTW processes. To improve the performance, we introduced the asymmetric sense, pitch level compensation, and distance intransitiveness to DTW algorithm.

  • PDF

A Study on Analysis Electrical Characteristics of Cable Lenght change about area Boundary of UM71C Audio Frequency Track Circuit (고속철도 AF궤도회로경계구간 케이블길이 변화에 따른 전기특성 분석연구)

  • Choi, Jae Sik;Kim, Hie Sik;Park, Ju Hun;Kim, Bum Gon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.7
    • /
    • pp.4849-4854
    • /
    • 2015
  • It has been often occurred for the outside components(BU, SVaC, DB) of UM71c AF track circuits to be broken down caused by some pieces of falling ice in the winter time or by infrastructure repairing equipments while facility maintenance works since 2004, opening of Kyeongbu High Speed Rail Express. In this paper, we proposed that we could move the outside components of UM71c track circuit out of wayside from present place. Then we can assure that the life time of those components would be extended. So we simulated the electrical characteristics by changing cable length using MATLAB Simulinks and we designed the compensation capacitor. Also, we obtained the same results as those of simulation by field demonstration test on site. The design specifications obtained from this field verification test could be applied in the absent section of track circuit, if only have a little more intensified research to compensate changed electrical characteristics and to redesign inner impedance of the track circuit.

Improvement of Lipreading Performance Using Gabor Filter for Ship Environment (선박 환경에서 Gabor 여파기를 적용한 입술 읽기 성능향상)

  • Shin, Do-Sung;Lee, Seong-Ro;Kwon, Jang-Woo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.7C
    • /
    • pp.598-603
    • /
    • 2010
  • In this paper, we work for Lipreading using visual information for ship environment. Lipreading is studied for using image information including lips of a speaker at the existing speech recognition system. This technique is a compensation method to increase recognition rate decreasing remarkably in noisy circumstances. Proposed way improved the rate of recognition improving methode of preprocessing using the Gabor Filter for Ship Environment. The experiment were carried out under changing of light with time in the ship environment with lip image. For Comparing with recognition, make a compare with between method of lip region of interest (ROI) before Gabor filtering and after Gabor filtering. In the case of using method of lip ROI before Gabor filtering, the result of the experiments applying to the proposed ways recognition resulting in 44% of recognition.

The Development of a Speech Recognition Method Robust to Channel Distortions and Noisy Environments for an Audio Response System(ARS) (잡음환경및 채널왜곡에 강인한 ARS용 전화음성인식 방식 연구)

  • Ahn, Jung-Mo;Yim, Kye-Jong;Kay, Young-Chul;Koo, Myoung-Wan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.41-48
    • /
    • 1997
  • This paper proposes the methods for improving the recognition rate of theARS, especially equipped with the speech recognition capability. Telephone speech, which is the input to the ARS, is usually affected by the announcements from the system, channel noise, and channel distortion, thus directly applying the recognition algorithm developed for clean speech to the noisy telephone speech will bring the significant performance degradation. To cope with this problem, this paper proposes three methods: 1)the accurate detection of the inputting instant of the speech in order to immediately turn off the announcements from the system at that instant, 2)the effective end-point detection of the noisy telephone speech on the basis of Teager energy, and 3)the SDCN-based compensation of the channel distortion. Experiments on speaker-independent, noisy telephone speech reveal that the combination of the above three proposed methods provides great improvements on the recognition rate over the conventional method, showing about 77% in contrast to only 23%.

  • PDF