Acknowledgement
This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (No.2022R1G1A1008798 ).
References
- O. Schwartz, S. Gannot, and E. A. P. Habets, "Multispeaker LCMV Beamformer and Postfilter for Source Separation and Noise Reduction," IEEE Transaction Audio Speech Language Processing, vol. 25, no. 5, pp. 940-951, May 2017. https://doi.org/10.1109/TASLP.2017.2655258
- Y. Kubo, T. Nakatani, M. Delcroix, K. Kinoshita, and S. Araki, "Mask-based MVDR Beamformer for Noisy Multisource Environments: Introduction of Time-varying Spatial Covariance Model," in Proceeding of the IEEE International Conference on Acoustics, Speech and Signal Processing, Brighton, UK, pp. 6855-6859, 2019.
- P. Rakesh, S. S. Priyanka, and T. Kumar, "Performance evaluation of beamforming techniques for speech Enhancement," in Proceedings of Fourth International Conference on Signal Processing Communication and Networking, Chennai, India, pp. 1-5, 2017.
- J. Park, J. Hong, J. Choi, and M. Hahn, "Determinant-based Generalized Sidelobe Canceller for Dual-Sensor Noise Reduction," IEEE Sensors Journal, vol. 22, no. 9, pp. 8858-8868, May 2022.
- S. M. Kim "Hearing Aid Speech Enhancement Using Phase Difference-Controlled Dual-Microphone Generalized Sidelobe Canceller," IEEE Access, vol. 7, no. 9, pp. 130663-130671, Sep. 2019. https://doi.org/10.1109/ACCESS.2019.2940047
- J. Kim and M. Hahn, "Speech Enhancement Using a Two-Stage Network for an Efficient Boosting Strategy," IEEE Signal Processing Letter, vol. 26, no. 5, pp. 770-774, Mar. 2019. https://doi.org/10.1109/lsp.2019.2905660
- J. Lee and H. G. Kang, "A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 6, pp. 1098-1108, Jun. 2019. https://doi.org/10.1109/taslp.2019.2910638
- D. Wang and J. Chen, "Supervised Speech Separation Based on Deep Learning: An Overview," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 10, pp. 1702-1726, Jun. 2018. https://doi.org/10.1109/taslp.2018.2842159
- S. Pascual, A. Bonafonte, and J. Serra, "SEGAN: Speech Enhancement Generative Adversarial Network," in Proceeding of Interspeech, Stockholm, Sweden, pp. 3642-3646, 2017.
- L. Zhang, M. Wang, Q. Zhang, X. Wang, and M. Liu, "PhaseDCN: A Phase-Enhanced Dual-Path Dilated Convolutional Network for Single Channel Speech Enhancement," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 2561-2574, Jun. 2021. https://doi.org/10.1109/TASLP.2021.3092585
- S. Jeong and Y.Kim, "An Optimally-Modified Multichannel Wiener Filter Using Speech Presence Probability," Smart Media Journal, vol. 7, no. 3, pp. 9-15, Sep. 2018. https://doi.org/10.30693/SMJ.2018.7.3.9
- M. Souden, J. Chen, J. Benesty, and S. Affes, "An Integrated Solution for Online Multichannel Noise Tracking and Reduction," IEEE Transactions Audio, Speech, Language Processing, vol. 19, no. 7, pp. 2159-2169, Sep. 2011. https://doi.org/10.1109/TASL.2011.2118205
- I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Transactions Speech Audio Processing, vol. 11, no. 5, pp. 466-475, Sep. 2003. https://doi.org/10.1109/TSA.2003.811544
- M. H. Hayes, Statistical Digital Signal Processing and Modeling, USA: Wiley, 1996.
- F. Asano, S. Hayamizu, T. Yamada, and S. Nakamura, "Speech enhancement based on the subspace method," IEEE Transaction Audio Speech Language Processing, vol. 8, no. 5, pp. 497-507, Sep. 2000. https://doi.org/10.1109/89.861364
- E. Warsitz and R. Haeb-Umbach, "Blind Acoustic Beamforming Based on Generalized Eigenvalue Eecomposition," IEEE Transaction Audio Speech Language Processing, vol. 15, no. 5, pp. 1529-1539, Jul. 2007. https://doi.org/10.1109/TASL.2007.898454
- P. C. Loizou, Speech Enhancement: Theory and Practice, Boca Raton, FL, USA: CRC, 2007.
- D. Pearce and H. Hirsch, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proceedings of Sixth International Conference on Spoken Language Processing, ICSLP 2000 / INTERSPEECH 2000, Beijing, China, pp. 16-20, 2000.
- J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," The Journal of the Acoustical Society of America, vol. 65, no. 4, pp. 943-950, Apr. 1979. https://doi.org/10.1121/1.382599
- E. A. P. Habets, "Generating sensor signals in isotropic noise fields," Journal of the Acoustical Society of America, vol. 122, no. 6, pp. 3464-3470, Dec. 2007. https://doi.org/10.1121/1.2799929