Search | Korea Science

Decomposition of Speech Signal into AM-FM Components Using Varialle Bandwidth Filter (가변 대역폭 필터를 이용한 음성신호의 AM-FM 성분 분리에 관한 연구)

Song, Min;Lee, He-Young
- Speech Sciences
- /
- v.8 no.4
- /
- pp.45-58
- /
- 2001
Modulated components of a speech signal are frequently used for speech coding, speech recognition, and speech synthesis. Time-frequency representation (TFR) reveals some information about instantaneous frequency, instantaneous bandwidth and boundary of each component of the considering speech signal. In many cases, the extraction of AM-FM components corresponding to instantaneous frequencies is difficult since the Fourier spectra of the components with time-varying instantaneous frequency are overlapped each other in Fourier frequency domain. In this paper, an efficient method decomposing speech signal into AM-FM components is proposed. A variable bandwidth filter is developed for the decomposition of speech signals with time-varying instantaneous frequencies. The variable bandwidth filter can extract AM-FM components of a speech signal whose TFRs are not overlapped in timefrequency domain. Also, amplitude and instantaneous frequency of the decomposed components are estimated by using Hilbert transform.
PDF

Designing of efficient super-wide bandwidth extension system using enhanced parameter estimation in time domain (시간 영역에서 개선된 파라미터 추론을 통한 효율적인 초광대역 확장 시스템 설계)

Jeon, Jong-jeon
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2018.10a
- /
- pp.431-433
- /
- 2018
This paper proposes the system that offer super-wideband speech which is made by artificial bandwidth extension technique using wideband speech signal in time-domain. wideband excitation signal and line spectrum pair(LSP) are extracted based on source-filter model in time-domain. Two parameters are extended by each bandwidth extension algorithms, and then, super-wideband speech parameters are estimated. and synthesized. Subjective test shows super-wideband speech is better speech quality than wideband speech signal.
PDF

Sub-Nyquist Nonuniform Sampling and Perfect Reconstruction of Speech Signals (음성신호의 Sub-Nyquist 비균일 표준화 및 완전 복구에 관한 연구)

Lee, He-Young
- Speech Sciences
- /
- v.12 no.2
- /
- pp.153-170
- /
- 2005
The sub-Nyquist nonuniform sampling (SNNS) and the perfect reconstruction (PR) formula are proposed for the development of a systematic method to obtain minimal representation of a speech signal. In the proposed method, the instantaneous sampling frequency (ISF) varies, depending on the least upper boundary of spectral support of a speech signal in time-frequency domain (TFD). The definition of the instantaneous bandwidth (IB), which determines the ISF and is used for generating the set of samples that represent continuous-time signals perfectly, is given. Also, the spectral characteristics of the sampled data generated by the sub-Nyquist nonuniform sampling method is analyzed. The proposed method doesn't generate the redundant samples due to the time-varying property of the instantaneous bandwidth of a speech signal.
PDF

Realization of Variable Bandwidth Filter for Decomposition of Speech Signals into AM-FM Components (음성신호의 AM-FM 성분 분리를 위한 가변대역폭 필터 구현)

이희영;김용태
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2208-2211
- /
- 2003
In this paper, a variable bandwidth filter(VBF) is realized with the purpose of the decomposition of speech signals with time-varying instantaneous of frequencies. The proposed VBF can extract AM-FM components of a speech signal whose time-frequency representations(TFRs) are not overlapped in time-frequency domain
PDF

High-Band Codec for Bandwidth Scalable Wideband Speech Codec (대역폭 계층 구조의 광대역 음성 부호화기를 위한 상위 대역 부호화기 연구)

Kim Youngvo;Jeong Byounghak;Son Chang-Yong;Sung Ho-Sang;Park Hochong
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.7
- /
- pp.395-401
- /
- 2005
In this paper, the high-band codec for bandwidth scalable wideband speech codec is proposed. The wideband input speech signal is separated into low-band signal and high-band signal, and the low-band signal is encoded by the standard narrow-band speech codec and the high-band signal is encoded by the proposed codec. In the high-band codec. the signal is transformed into frequency domain by MLT on a subframe basis, and MLT coefficients are splitted into magnitude and sign for quantization. The magnitudes of MLT coefficients are arranged into several time-frequency bands and each band is quantized in 2D-DCT domain, where the low-band information is utilized for better performance. The sign of MLT coefficient is quantized based on a priority selection process with the weighting measurement. The objective and subjective performance of wideband speech codec including the proposed high-band codec is measured, and it is confirmed that the proposed codec has better performance than 32kbps G.722.1.
PDF KSCI

A Study on the Bandwidth Extension Adopted for 4800 bps CELP Speech Coder (4800bps CELP 음성 부호화기에 적용한 대역폭 확장에 관한 연구)

Park Sin Soo;Kim Hyung Soon
- Proceedings of the KSPS conference
- /
- 2002.11a
- /
- pp.175-178
- /
- 2002
Most existing telephone networks transmit narrowband speech witch has been bandlimited below 4 kHz. Compared with wideband speech up to 8 kHz, narrowband speech shows reduced intelligibility and a muffled quality. Bandwidth extension is a technique to generate wideband speech by reconstructing 4-8 kHz highband speech without any additional information. This paper presents experimental results of the bandwidth extension adopted for 4800 bps CELP speech coder. In this experiment, we examine various methods for reconstruction of wideband spectrum and excitation signal, compare and analyze their performance by performing the subjective preference test and measuring the cepstral distortion.
PDF

A Scalable Audio Coder for High-quality Speech and Audio Services

Lee, Gil-Ho;Lee, Young-Han;Kim, Hong-Kook;Kim, Do-Young;Lee, Mi-Suk
- MALSORI
- /
- no.61
- /
- pp.75-86
- /
- 2007
In this paper, we propose a scalable audio coder, which has a variable bandwidth from the narrowband speech bandwidth to the audio bandwidth and also has a bit-rate from 8 to 320 kbits/s, in order to cope with the quality of service(QoS) according to the network load. First of all, the proposed scalable coder splits bandwidth of the input audio into narrowband up to around 4 kHz and above. Next, the narrowband signals are compressed by a speech coding method compatible to an existing standard speech coder such as G.729, and the other signals whose bandwidth is above the narrowband are compressed on the basis of a psychoacoustic model. It is shown from the objective quality tests using the signal-to-noise ratio(SNR) and the perceptual evaluation of audio quality(PEAQ) that the proposed scalable audio coder provides a comparable quality to the MPEG-1 Layer III (MP3) audio coder.
PDF

Improvement of acoustic feedback stability by bandwidth compression and expansion

염동홍;안수길
- The Journal of the Acoustical Society of Korea
- /
- v.4 no.1
- /
- pp.16-21
- /
- 1985
Both shifiting the input signal's frequencies by a fixed frequency and compressing the input signal's bandwidth have been known to be effective in improving the stability margin of public adress systems operating in reverberant spaces. This paper describes the effect of an alternative approach of improving the acoustic-feedback stability and yet maintaining speech inteligibility by bandwidth compression and expansion. Conditions are derived for this technizue to be realized and an experimental system has been made - up. A series of experiments has been performed in small spaces and the results have shown that more than 5dB improvement can be obtained in the stability margin.
PDF

Bandwidth Scalable Wideband Speech Codec (대역폭 계층 구조의 광대역 음성 부호차기 개발)

이우석;손창용;이영범;박호종
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.6
- /
- pp.481-487
- /
- 2004
In this paper. the structure of bandwidth scalable wideband speech codec and its high-band codec are proposed. In the high-band codec. the signal is divided into frequency bands. and each band is quantized in DCT domain. The DCT coefficients are splitted into magnitude and sign, and each is quantized independently by a specialized method based on its characteristics. In addition. the quantized gain parameter in the low-band codec is utilized in the high-band codec for an enhanced performance. The bandwidth scalable wideband speech codec using G.729E for low-band and the proposed codec for high-band is developed, and it is confirmed that the proposed codec has better subjective performance than 24kbps G.722.1.
PDF KSCI

On Improving Resolution of Time-Frequency Representation of Speech Signals Based on Frequency Modulation Type Kernel (FM변조된 형태의 Kernel을 사용한 음성신호의 시간-주파수 표현 해상도 향상에 관한 연구)

Lee, He-Young;Choi, Seung-Ho
- Speech Sciences
- /
- v.12 no.4
- /
- pp.17-29
- /
- 2005
Time-frequency representation reveals some useful information about instantaneous frequency, instantaneous bandwidth and boundary of each AM-FM component of a speech signal. In many cases, the instantaneous frequency of each component is not constant. The variability of instantaneous frequency causes degradation of resolution in time-frequency representation. This paper presents a method of adaptively adjusting the transform kernel for preventing degradation of resolution due to time-varying instantaneous frequency. The transform kernel is the form of frequency modulated function. The modulation function in the transform kernel is determined by the estimate of instantaneous frequency which is approximated by first order polynomial at each time instance. Also, the window function is modulated by the estimated instantaneous. frequency for mitigation of fringing. effect. In the proposed method, not only the transform kernel but also the shape and the length of. the window function are adaptively adjusted by the instantaneous frequency of a speech signal.
PDF

Search Result 37, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)