Highband Coding Method Using Matching Pusuit Estimation and CELP Coding for Wideband Speech Coder

Jeong Gyu-Hyeok;Ahn Yeong-Uk;Kim Jong-Hark;Shin Jae-Hyun;Seo Sang-Won;Hwang In-Kwan;Lee In-Sung;

doi:10.7776/ASK.2006.25.1.021

한국음향학회지 (The Journal of the Acoustical Society of Korea)

제25권1호
/
Pages.21-29
/
2006
/
1225-4428(pISSN)
/
2287-3775(eISSN)

한국음향학회 (The Acoustical Society of Korea)

DOI QR Code

광대역 음성부호화기를 위한 매칭퍼슈잇 알고리즘과 CELP 방법을 이용한 고대역 부호화 방법

Highband Coding Method Using Matching Pusuit Estimation and CELP Coding for Wideband Speech Coder

정규혁 (충북대학교 전파공학과) ;
안영욱 (코아로직) ;
김종학 (충북대학교 전파공학과) ;
신재현 (충북대학교 전파공학과) ;
서상원 (충북대학교 전파공학과) ;
황인관 (충북대학교 전파공학과) ;
이인성 (충북대학교 전파공학과)

발행 : 2006.01.01

https://doi.org/10.7776/ASK.2006.25.1.021 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 대역분활 광대역 음성부호화기와 이를 위한 고대역 부호화 방법과 구조를 제안한다. 제안하는 광대역 음성부호화기는 광대역 입력 음성신호를 저대역 신호 (OHz-4kHz)와 고대역 신호 (4kHz-8kHz)로 나눈다. 저대역 신호는 G.729 Annex E로 부호화하고, 고대역 신호는 4kbps의 전송률로 제안하는 방법으로 압축된다. 고대역 신호는 LPC 분석 후 신호특성에 따라 모드를 분류된다. stationary 모드에서는 매칭퍼슈잇 알고리즘과 CELP 방법으로 부호화하는 다단계 구조의 혼합 여기신호모델이 적용되며, nonstationary 모드에서는 CELP 방법으로 부호화된다. 제안한 광대역 음성부호화기의 성능을 주관적 방법으로 G.722 48kbps SB-ADPCM, G.722.2 12.85kbps ACELP와 비교를 하였다. 제안한 부호화기는 G.722보다 나은 성능을 보이고, G.722.2보다 나쁘지 않은 성능을 가지는 것을 확인하였다.

In this Paper a split bandwidth wideband speech coder and its highband coding method are Proposed. The coder uses a split-band approach. where the wideband input speech signal is split into two equal frequency bands from 0-4kHz and 4-8kHz. The lowband and the highband are coded respectively by the 11.8kb/s G.729 Annex E and the proposed coding method. After the LPC analysis, the highband is divided by two modes according to the properties of signals. In stationary mode. the highband signals are compressed by the mixture excitation model; CELP algorithm and W (Matching Pursuit) algorithm. The others are coded by the only CELP algorithm. We compare the performance of the new wideband speech coder with that of G.722 48kbps SB-ADPCM and G.722.2 12.85kbps in a subjective method. The simulation results show that the Performance of the proposed wideband speech coder has better than that of 48kbps G.722 and no better than that of 12.85kbps G.722.2.

키워드

참고문헌

이미숙, '광대역 코덱의 기술 및 표준화 동향,' TTA Journal, 65-71, Mar. 2004
ITU-T SG16 Q.9, 'Report of 0.9/16 meeting,' Nov. 2004
T. Nomura, M. Iwadare, M. Serizawa and K. Ozawa, 'A bit rate and bandwidth scalable CELP coder,' IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 341-344. May 1998
K. Koishida, V. Cuperman and A. Gersho, 'A 16-kbit/s bandwidth scalable audio coder based on the G.729 standard,' IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 1149-1152. Jun. 2000
Sung-Kyo Jung, Kyung-Tae Kini and Hong-Goo Kang, 'A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standard,' IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 285-288. May 2004
Lajos Hanzo, F. Clare, A. Somerville and Jason P. Woodard, Voice Compression and Communications (John Wiley & Sons Ltd., New York, 2001), 531-564
송재종, 박호종, 김무영, 김도석, 김정수, '광대역 신호 압축기를 위한 주파수 대역 특성에 선택적인 양자화 방법,' 음향학회지 20 (7), 76-82, 2001
이우석, 박호종, 손창용, 이영범, '대역폭 계층 구조의 광대역 음성 부호화기 개발', 음향학회지 23 (6), 481-487, 2004
B. Kovesi, D. Massaloux and A. Sollaud, 'A scalable speech and audio coding scheme with continuous bitrate flexibility,' IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 273-276, May 2004
오연선, 신재현, 이인성, 'MLT 여기신호를 이용한 광대역 음성 부호화기 설계,' 음향학회지 24 (5), 230-237, 2005
ITU- T Recommendation. G.729 Annex E, '11.8kblt/s CS-ACELP speech coding algorithm', Sep. 1998
3GPP C.S0030-0, 'Selectable mode vocoder service option for wideband spread spectrum communication system', Dec. 2001
R. McAulay and T, Quatieri, 'Speech Analysis/ Synthesis Based on a Sinusoidal Representation,' IEEE Transactions on Signal Processing, 34, 744-754, Aug. 1986 https://doi.org/10.1109/TASSP.1986.1164910
S. G. Mallet and Zhifeng Zhang, 'Matching pursuit with time-frequency dictionaries', IEEE Transactions on Signal Processing, 41, 3397-3415, 1993 https://doi.org/10.1109/78.258082
E. B. George and M. J. T. Smith, 'Speech analysis/ synthesis and modification using an analysis-bysynthesis/overlap-add sinusoidal model,' IEEE Transcations on Signal Processing, 5, 389-406, Sep. 1997
안영욱, 정규혁, 김종학, 양용호, 이인성 '정현파 모델 부호화기를 위한 MP(Matching Pursuit)알고리즘과 파라미터 양자화기' 음향학회지 24 (7) 402-409, 2005
Kyung jin Bvun, Hee Bum .Junc, Minsoo Hahn and Kyung Soo Kim, 'A Fast ACELP Code book Search Method,' IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 422-425, Aug. 2002
A. M. Kondoz, Digital Speech(John Wiley & Sons Ltd., New York, 1994), 174-212
ITU-T Recommendation. G.722, '7 kHz audio-coding within 64 kbit/s.' Nov. 1988
ITU-T Recommendation. G.722.2, 'Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband,' Jan. 2001

한국음향학회지 (The Journal of the Acoustical Society of Korea)

광대역 음성부호화기를 위한 매칭퍼슈잇 알고리즘과 CELP 방법을 이용한 고대역 부호화 방법

Highband Coding Method Using Matching Pusuit Estimation and CELP Coding for Wideband Speech Coder

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)