Browse > Article
http://dx.doi.org/10.13067/JKIECS.2013.8.11.1793

Evaluation of a signal segregation by FDBM  

Lee, Chai-Bong (동서대학교 정보시스템공학부)
Publication Information
The Journal of the Korea institute of electronic communication sciences / v.8, no.12, 2013 , pp. 1793-1802 More about this Journal
Abstract
Various approaches for sound source segregation have been proposed. Among these approaches, frequency domain binaural model(FDBM) has the advantages of low computational load and effective howling cancellation. A binaural hearing assistance system based on FDBM has been proposed. This system can enhance desired signal based on the directivity information. Although FDBM has been evaluated in terms of signal-to-noise ratio (SNR) and coherence function, the evaluation results do not always agree with the human impressions. These evaluation methods provide physical measures, and do not take account of perceptual aspect of human being. Considering a binaural hearing assistance system as a one of major applications, the quality of segregated sound should keep level enough. In the paper, signal segregation performance by means of FDBM is evaluated by three objective methods, i.e., SNR, coherence and Perceptual Evaluation of Speech Quality(PESQ), to discuss the characteristic of FDBM on the sound source segregation performance. The simulation's evaluation results show that FDBM improves the quality of the left and right channel signals to an equivalent level. And the results suggest the possibility that PESQ provides a more useful measure than SNR and coherence in terms of the segregation performance of FDBM. The evaluation results by PESQ show the effects from segregation parameters and indicate appropriate parameters under the conditions. In the paper, signal segregation performance by means of FDBM is evaluated by three objective methods, i.e., SNR, coherence and PESQ, to discuss the characteristic of FDBM on the sound source segregation performance. The simulation's evaluation results show that FDBM improves the quality of the left and right channel signals to an equivalent level. And the results suggest the possibility that PESQ provides a more useful measure than SNR and coherence in terms of the segregation performance of FDBM. The evaluation results by PESQ show the effects from segregation parameters and indicate appropriate parameters under the conditions.
Keywords
FDBM; SNR; Coherence Function; PESQ; Sound Source Segregation;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Tsuyoshi Usagawa, Hirokazu Shimada, Yoshiaki Sawada, Yoshifumi Chisaki and Masanao Ebata, "A microphone array system using iterative echo suppression method as inverse filtering", Acoustical Science & Technology, Vol. 22, No. 4, pp. 315-317, 2001.   DOI   ScienceOn
2 Shoji Makino, Shoko Araki, Ryo Mukai, Hiroshi Sawada and Hiroshi Saruwatari, "ICA-based blind source separation of sounds", Proceedings of the Japan-China Joint Conference on Acoustics 2002, pp. 83-86, 2002.
3 Tomoya Takatani, Tsuyoki Nishikawa, Hiroshi Saruwatari, "Blind Source Separation based on Binaural ICA", Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 5, pp. 321-324, 200
4 Hidetoshi Nakashima, Yoshifumi Chisaki, Tsuyoshi Usagawa and Masanao Ebata, "Frequency domain binaural model based on interaural phase and level difference", Acoustical Science & Technology, Vol. 24, No. 4, pp. 172-178. 2003.   DOI   ScienceOn
5 Yoshifumi Chisaki, Kotaro Matsuo and Tsuyoshi Usagawa, "Howling canceler using interaural level difference for binaural hearing assistant system", Acoustical Science & Technology, Vol. 28, No. 2, pp. 90-97, 2007.   DOI   ScienceOn
6 Markus Bodden, "Modeling human soundsource localization and the cocktailpartyeffect", Acta Acoustica, Vol. 1, pp. 43-45, 1993.
7 Takashi Nakanishi, Norifumi Sato, Hidetoshi Nakashima, Yoshifumi Chisaki, Tsuyoshi Usagawa and Masanao Ebata, "Sound Source Segregation under reverberant condition using Frequency Domain Binaural Model", Proceedings of Kyushu-Youngnam Joint Conference on Acoustics 2003, pp. 129-132, 2003.
8 Tsuyoshi Usagawa, Rika Matsuo, Takashi Nakanishi, Hidetoshi Nakashima and Yoshifumi Chisaki, "Concurrent Speech Segregation based on DOA Information using Frequency Domain Binaural Model -An application for hearing aid-", Proceedings of International Congress on Acoustics 2004, Vol. 5, pp. 3655-3658, 2004.
9 Chai-bong Lee, "The effect of leading tone and following tone with single frequency on sound lateralization", The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 5, No. 3, pp. 251-255, 2010.
10 ITU-T Recommendation, "Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs", p. 862, 2001.
11 Chai-bong Lee, "The effect of a temporal masking on the sound laterlization", The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 5, No. 4, pp. 352-356, 2010.
12 Chai-bong Lee, "A study on the simplification of HRTF within low frequency region," The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 5, No. 6, pp. 581-587, 2010.   과학기술학회마을
13 Bill Gardner and Keith martin, "HRTF measurements to a KEMAR dummy head microphone," MIT Media lab Perceptual Computing Technical Report#280, 1994.
14 Eberhard Zwicker, "Subdivision od the audible frequendy rang into critical bands", Journal of the Acoustical Society of America, Vol. 33, No. 2, pp. 248, 1961.   DOI
15 Eberhard Zwicker, Hugo Fastl, Psychoacoustics : Facts and Models, Spring-Verlag, Berlin, 1990.
16 The Acoustical Society of Japan, "A serial speech data base for research purpose", The Journal of the acoustical society of Japan, Vol. 48, No. 12, pp. 888-893, 1992.