Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training

Kang, Sang-Ick;Chang, Joon-Hyuk;Lee, Seong-Ro;

doi:10.7776/ASK.2008.27.6.319

한국음향학회지 (The Journal of the Acoustical Society of Korea)

제27권6호
/
Pages.319-324
/
2008
/
1225-4428(pISSN)
/
2287-3775(eISSN)

한국음향학회 (The Acoustical Society of Korea)

DOI QR Code

변별적 가중치 학습을 이용한 3GPP2 SVM의 실시간 음성/음악 분류 성능 향상

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training

강상익 (인하대학교 전자공학부) ;
장준혁 (인하대학교 전자공학부) ;
이성로 (목포대학교 정보공학부)

발행 : 2008.08.31

https://doi.org/10.7776/ASK.2008.27.6.319 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 변별적 가중치 학습 (discriminative weight training) 기반의 3GPP2 Selectable Mode Vocoder (SMV) 실시간 음성/음악 분류 성능을 향상 시키는 방법을 제안한다. SMV의 음성/음악 실시간 분류 알고리즘에서 사용된 특징벡터와 분류방법을 분석하고, 이를 기반으로 분류성능향상을 위해 MCE (minimum classification error)방법을 도입하여, 각 특징 백터별로 다른 가중치를 적용하는 음성/음악 결정법 (decision rule)을 제시한다. 구체적으로 SMV의 음성/음악 분류알고리즘에서 사용되어진 특징벡터만을 선택적으로 사용하여 가중치를 적용한 값을 기하 평균한 값을 문턱값과 비교하는 실시간 분류기법이 제시되었다. SMV의 음성/음악 분류에 제안한 방법의 성능 평가를 위해 SMV 원래의 분류알고리즘과 비교하였으며, 다양한 음악장르에 대해 시스템의 성능을 평가한 결과 가중치를 적용하였을 때 기존의 SMV의 방법보다 우수한 음성/음악 분류 성능을 보였다.

In this paper, we propose a novel approach to improve the performance of speech/music classification for the selectable mode vocoder (SMV) of 3GPP2 using the discriminative weight training which is based on the minimum classification error (MCE) algorithm. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then proposed the speech/music decision rule is expressed as the geometric mean of optimally weighted features which are selected from the SMV. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.

키워드

참고문헌

Y. Gao, E. Shlomot, A. Benyassine, J. Thyssen, H.-Y. Su, and C. Murgia, "The SMV algorithm selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 709 -712, May 2001
3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, v.1.0, Apr. 2005
J. Saunders, "Real-time discrimination of broadcast speech /music," Proc. IEEE International Conference on Acoustics, Speech, and Processing, 2, 993-996, May 1996
W. Q. Wang, W. Gao, and D. W. Ying, "A fast and robust speech/music discrimination approach," Proc. International Conference on Information, Communications and Signal Processing, 3, 1325-1329, Dec. 2003
금지수, 임성길, 이현수, "스펙트럼 분석과 신경망을 이용한 음성/음악 분류", 한국음향학회지, 26(5), 207-213, Jul. 2007
J. Makinen, P. Ojala, and H. Toukomaa, "Performance comparison of source controlled GSM AMR and SMV vocoders," Proc. International Symposium on Intelligent Signal Processing and Communication Systems, 51-154, Nov. 2004
C. V. Goudar, P. Rabha, M. Deshpande, and A. Rao, "SMVLite: Reduced Complexity Selectable Mode Vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 701-704, May 2006
3GPP2 Spec., "Selectable mode vocoder (SMV) service option for wideband spread spectrum communication systems," 3GPP2 -C.S0030-0, v3.0, Jan. 2004
S. Craig Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 953-956, May 2001
P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, pp.182-187, 2006
P. Kabal, R. Prakash and Ramachandran, "The computation of line spectral frequencies using Chebyshey polynomials," IEEE Trans. Acoustics, speech and signal processing, ASSP -34(6), 1419-1426, Dec. 1986 https://doi.org/10.1109/TASSP.1986.1164983
B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Processing, 5(3), 257-265, May 1997 https://doi.org/10.1109/89.568732
S.-I. Kang, Q.-H. Jo, J.-H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Processing Letters, 15, 170-173, Feb. 2008 https://doi.org/10.1109/LSP.2007.913595
W. M. Fisher, G. R. Doddington and K. M. Goudie-Marshall, "The DARPA speech recognition research database: Specifi-cations and status," Proc. DARPA Workshop Speech Recognition, pp.93-99, Feb. 1986

한국음향학회지 (The Journal of the Acoustical Society of Korea)

변별적 가중치 학습을 이용한 3GPP2 SVM의 실시간 음성/음악 분류 성능 향상

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)