Search | Korea Science

Song, Chai-Jong;Jang, Dalwon;Lee, Seok-Pil;Park, Hochong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2012.07a
- /
- pp.164-166
- /
- 2012
In this paper, we proposed a method for extracting predominant melody of polyphonic music based on harmonic structure. Harmonic structure is an important feature parameter of monophonic signal that has spectral peaks at the integer multiples of its fundamental frequency. We extract all fundamental frequency candidates contained in the polyphonic signal by verifying the required condition of harmonic structure. Then, we combine those harmonic peaks corresponding to each extracted fundamental frequency and assign a rank to each after calculating its harmonic average energy. We run pitch tracking based on the rank of extracted fundamental frequency and continuity of fundamental frequency, and determine the predominant melody. For the query by singing/humming (QbSH) task, we proposed Dynamic Time Warping (DTW) based matching engine. Our system reduces false alarm by combining the distances of multiple DTW processes. To improve the performance, we introduced the asymmetric sense, pitch level compensation, and distance intransitiveness to DTW algorithm.
PDF

Choi, Jae Sik;Kim, Hie Sik;Park, Ju Hun;Kim, Bum Gon
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.16 no.7
- /
- pp.4849-4854
- /
- 2015
It has been often occurred for the outside components(BU, SVaC, DB) of UM71c AF track circuits to be broken down caused by some pieces of falling ice in the winter time or by infrastructure repairing equipments while facility maintenance works since 2004, opening of Kyeongbu High Speed Rail Express. In this paper, we proposed that we could move the outside components of UM71c track circuit out of wayside from present place. Then we can assure that the life time of those components would be extended. So we simulated the electrical characteristics by changing cable length using MATLAB Simulinks and we designed the compensation capacitor. Also, we obtained the same results as those of simulation by field demonstration test on site. The design specifications obtained from this field verification test could be applied in the absent section of track circuit, if only have a little more intensified research to compensate changed electrical characteristics and to redesign inner impedance of the track circuit.
https://doi.org/10.5762/KAIS.2015.16.7.4849 인용 PDF KSCI

Shin, Do-Sung;Lee, Seong-Ro;Kwon, Jang-Woo
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.35 no.7C
- /
- pp.598-603
- /
- 2010
In this paper, we work for Lipreading using visual information for ship environment. Lipreading is studied for using image information including lips of a speaker at the existing speech recognition system. This technique is a compensation method to increase recognition rate decreasing remarkably in noisy circumstances. Proposed way improved the rate of recognition improving methode of preprocessing using the Gabor Filter for Ship Environment. The experiment were carried out under changing of light with time in the ship environment with lip image. For Comparing with recognition, make a compare with between method of lip region of interest (ROI) before Gabor filtering and after Gabor filtering. In the case of using method of lip ROI before Gabor filtering, the result of the experiments applying to the proposed ways recognition resulting in 44% of recognition.
PDF KSCI

Ahn, Jung-Mo;Yim, Kye-Jong;Kay, Young-Chul;Koo, Myoung-Wan
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.2
- /
- pp.41-48
- /
- 1997
This paper proposes the methods for improving the recognition rate of theARS, especially equipped with the speech recognition capability. Telephone speech, which is the input to the ARS, is usually affected by the announcements from the system, channel noise, and channel distortion, thus directly applying the recognition algorithm developed for clean speech to the noisy telephone speech will bring the significant performance degradation. To cope with this problem, this paper proposes three methods: 1)the accurate detection of the inputting instant of the speech in order to immediately turn off the announcements from the system at that instant, 2)the effective end-point detection of the noisy telephone speech on the basis of Teager energy, and 3)the SDCN-based compensation of the channel distortion. Experiments on speaker-independent, noisy telephone speech reveal that the combination of the above three proposed methods provides great improvements on the recognition rate over the conventional method, showing about 77% in contrast to only 23%.
PDF