DOI QR코드

DOI QR Code

변별적 가중치 학습을 이용한 3GPP2 SVM의 실시간 음성/음악 분류 성능 향상

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training

  • 발행 : 2008.08.31

초록

본 논문에서는 변별적 가중치 학습 (discriminative weight training) 기반의 3GPP2 Selectable Mode Vocoder (SMV) 실시간 음성/음악 분류 성능을 향상 시키는 방법을 제안한다. SMV의 음성/음악 실시간 분류 알고리즘에서 사용된 특징벡터와 분류방법을 분석하고, 이를 기반으로 분류성능향상을 위해 MCE (minimum classification error)방법을 도입하여, 각 특징 백터별로 다른 가중치를 적용하는 음성/음악 결정법 (decision rule)을 제시한다. 구체적으로 SMV의 음성/음악 분류알고리즘에서 사용되어진 특징벡터만을 선택적으로 사용하여 가중치를 적용한 값을 기하 평균한 값을 문턱값과 비교하는 실시간 분류기법이 제시되었다. SMV의 음성/음악 분류에 제안한 방법의 성능 평가를 위해 SMV 원래의 분류알고리즘과 비교하였으며, 다양한 음악장르에 대해 시스템의 성능을 평가한 결과 가중치를 적용하였을 때 기존의 SMV의 방법보다 우수한 음성/음악 분류 성능을 보였다.

In this paper, we propose a novel approach to improve the performance of speech/music classification for the selectable mode vocoder (SMV) of 3GPP2 using the discriminative weight training which is based on the minimum classification error (MCE) algorithm. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then proposed the speech/music decision rule is expressed as the geometric mean of optimally weighted features which are selected from the SMV. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.

키워드

참고문헌

  1. Y. Gao, E. Shlomot, A. Benyassine, J. Thyssen, H.-Y. Su, and C. Murgia, "The SMV algorithm selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 709 -712, May 2001
  2. 3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, v.1.0, Apr. 2005
  3. J. Saunders, "Real-time discrimination of broadcast speech /music," Proc. IEEE International Conference on Acoustics, Speech, and Processing, 2, 993-996, May 1996
  4. W. Q. Wang, W. Gao, and D. W. Ying, "A fast and robust speech/music discrimination approach," Proc. International Conference on Information, Communications and Signal Processing, 3, 1325-1329, Dec. 2003
  5. 금지수, 임성길, 이현수, "스펙트럼 분석과 신경망을 이용한 음성/음악 분류", 한국음향학회지, 26(5), 207-213, Jul. 2007
  6. J. Makinen, P. Ojala, and H. Toukomaa, "Performance comparison of source controlled GSM AMR and SMV vocoders," Proc. International Symposium on Intelligent Signal Processing and Communication Systems, 51-154, Nov. 2004
  7. C. V. Goudar, P. Rabha, M. Deshpande, and A. Rao, "SMVLite: Reduced Complexity Selectable Mode Vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 701-704, May 2006
  8. 3GPP2 Spec., "Selectable mode vocoder (SMV) service option for wideband spread spectrum communication systems," 3GPP2 -C.S0030-0, v3.0, Jan. 2004
  9. S. Craig Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 953-956, May 2001
  10. P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, pp.182-187, 2006
  11. P. Kabal, R. Prakash and Ramachandran, "The computation of line spectral frequencies using Chebyshey polynomials," IEEE Trans. Acoustics, speech and signal processing, ASSP -34(6), 1419-1426, Dec. 1986 https://doi.org/10.1109/TASSP.1986.1164983
  12. B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Processing, 5(3), 257-265, May 1997 https://doi.org/10.1109/89.568732
  13. S.-I. Kang, Q.-H. Jo, J.-H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Processing Letters, 15, 170-173, Feb. 2008 https://doi.org/10.1109/LSP.2007.913595
  14. W. M. Fisher, G. R. Doddington and K. M. Goudie-Marshall, "The DARPA speech recognition research database: Specifi-cations and status," Proc. DARPA Workshop Speech Recognition, pp.93-99, Feb. 1986