Speech Quality of a Sinusoidal Model Depending on the Number of Sinusoids

Seo, Jeong-Wook;Kim, Ki-Hong;Seok, Jong-Won;Bae, Keun-Sung;

Speech Sciences (음성과학)

Volume 7 Issue 1
/
Pages.17-29
/
2000
/
1226-5276(pISSN)

Korean Society of Speech Sciences (한국음성학회)

Speech Quality of a Sinusoidal Model Depending on the Number of Sinusoids

Seo, Jeong-Wook (School of Electronic and Electrical Eng., Kyungpook National Univ.) ;
Kim, Ki-Hong (LG Electronics Co.) ;
Seok, Jong-Won (ETRI, Broadcasting Technology Department, Raio & Broadcasting Technology Lab.) ;
Bae, Keun-Sung (School of Electronic and Electrical Eng., Kyungpook National Univ.)

Published : 2000.03.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

The STC(Sinusoidal Transform Coding) is a vocoding technique that uses a sinusoidal speech model to obtain high- quality speech at low data rate. It models and synthesizes the speech signal with fundamental frequency and its harmonic elements in frequency domain. To reduce the data rate, it is necessary to represent the sinusoidal amplitudes and phases with as small number of peaks as possible while maintaining the speech quality. As a basic research to develop a low-rate speech coding algorithm using the sinusoidal model, in this paper, we investigate the speech quality depending on the number of sinusoids. By varying the number of spectral peaks from 5 to 40 speech signals are reconstructed, and then their qualities are evaluated using spectral envelope distortion measure and MOS(Mean Opinion Score). Two approaches are used to obtain the spectral peaks: one is a conventional STFT (Short-Time Fourier Transform), and the other is a multiresolutional analysis method.

Speech Sciences (음성과학)

Speech Quality of a Sinusoidal Model Depending on the Number of Sinusoids

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)