Browse > Article
http://dx.doi.org/10.7776/ASK.2014.33.1.054

CASA Based Approach to Estimate Acoustic Transfer Function Ratios  

Shin, Minkyu (고려대학교 전자전기전파공학부)
Ko, Hanseok (고려대학교 전자전기전파공학부)
Abstract
Identification of RTF (Relative Transfer Function) between sensors is essential to multichannel speech enhancement system. In this paper, we present an approach for estimating the relative transfer function of speech signal. This method adapts a CASA (Computational Auditory Scene Analysis) technique to the conventional OM-LSA (Optimally-Modified Log-Spectral Amplitude) based approach. Evaluation of the proposed approach is performed under simulated stationary and nonstationary WGN (White Gaussian Noise). Experimental results confirm advantages of the proposed approach.
Keywords
System identification; Relative transfer function; Multi-microphone speech enhancement;
Citations & Related Records
연도 인용수 순위
  • Reference
1 L. Griffiths and C. Jim, "An alternative approach to linearly constrained adaptive beamforming," IEEE Trans Antennas Propag, 30, 27-34 (1982).   DOI
2 A. Krueger, E. Warsitz, and R. Haeb-Umbach, "Speech enhancement with a GSC-like structure employing eigenvector-based transfer function ratios estimation," IEEE Trans Audio Speech Lang Processing, 19, 206-219 (2011).   DOI
3 S. Gannot, D. Burshtein, and E. Weinstein, "Signal enhancement using beamforming and nonstationarity with applications to speech," IEEE Trans Signal Processing, 49, 1614-1626 (2001).   DOI   ScienceOn
4 O. Shalvi and E. Weinstein, "System identification using nonstationary signals," IEEE Trans Signal Processing, 44, 2055-2063 (1996).   DOI   ScienceOn
5 I. Cohen, "Relative transfer function identification using speech signals," IEEE Trans Speech Audio Process, 12, 451-459 (2004).   DOI   ScienceOn
6 R. Talmon, I. Cohen, and S. Gannot, "Relative transfer function identification using convolutive transfer function approximation," IEEE Trans Audio Speech Lang Processing, 17, 546-555 (2009).   DOI
7 I. Cohen and B. Berdugo, "Speech enhancement for nonstationary noise environments," Signal processing, 81, 2403-2418 (2001).   DOI   ScienceOn
8 D. Wang and G. J. Brown, Computational auditory scene analysis: Principles, algorithms, and applications (Wiley-IEEE Press, New York, 2006), pp. 81-114.
9 G. Hu and D. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans Neural Netw, 15, 1135-1150 (2004).   DOI   ScienceOn
10 "DARPA Resource Management Continuous Speech Database (RM1)," NIST Speech Disc 2-5.1 (1996).