템플릿 기반 음향 신호 분리 기술 연구

  • 권기수 (서울대학교 전기정보공학부) ;
  • 김남수 (서울대학교 전기정보공학부)
  • Published : 2016.05.25

Abstract

Keywords

References

  1. R. Larsen and R. M. Aarts. Audio bandwidth extension: application of psychoacoustics, signal processing and loudspeaker design. John Wiley & Sons, 2005.
  2. K. Kwon, J. W. Shin, and N. S. Kim, "NMF-based speech enhancement using bases update," IEEE Signal Process. Lett., vol. 22, no. 4, pp. 450-454, Apr. 2015. https://doi.org/10.1109/LSP.2014.2362556
  3. K. Kwon, J. W. Shin, H. Y. Kim, and N. S. Kim, "Discriminative nonnegative matrix factorization using cross-reconstruction error for source separation," Proc. of ISCA Interspeech, 2015.
  4. Naik, Ganesh R., and Wenwu Wang, eds. Blind Source Separation: Advances in Theory, Algorithms and Applications. Springer, 2014.
  5. T. Virtanen, J. F. Gemmeke, B. Raj, and P. Smaragdis, "Compositional model for audio processing," SPM, Mar. 2015.
  6. A. Cichocki, R. Zdunek, A. H. Phan, and S. Amari, Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation. Hoboken, NJ: Wiley, 2009.
  7. J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplarbased sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Processing, vol. 19, no. 7, pp. 2067-2080, 2011. https://doi.org/10.1109/TASL.2011.2112350
  8. Y.-C. Cho and S. Choi, "Nonnegative features of spectrotemporal sounds for classification," Pattern Recognit. Lett., vol. 26, no. 9, pp. 1327-1336, 2005. https://doi.org/10.1016/j.patrec.2004.11.026
  9. A. Ozerov and C. Fevotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. Audio, Speech, Lang. Processing, vol. 18, no. 3, pp. 550-563, 2010. https://doi.org/10.1109/TASL.2009.2031510
  10. M. D. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. E. Davies,"Sparse representations in audio & music: From coding to source separation," Proc. IEEE, vol. 98, no. 6, pp. 995-1005, 2009. https://doi.org/10.1109/JPROC.2009.2030345
  11. R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. IEEE Spoken Language Technology Workshop, 2012, pp. 313-317.
  12. D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization," Nature, vol. 401, pp. 788-791, 1999. https://doi.org/10.1038/44565
  13. P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. on Audio, Speech, and Language process., vol. 15, no. 1, Jan. 2007.
  14. K. Kwon, J. W. Shin, and N. S. Kim, "Target source separation based on discriminative nonnegative matrix factorization incorporating cross-reconstruction error," IEICE Transactions on Information and Systems, Vol. E98.D, No. 11, pp. 2017-2020, 2015 https://doi.org/10.1587/transinf.2015EDL8114
  15. C. Boutsidis and E. Gallopoulos, "SVD based initialization: A head start for nonnegative matrix factorization," Pattern Recognition, vol. 41, no. 4, pp. 1350-1362, 2008. https://doi.org/10.1016/j.patcog.2007.09.010