Search | Korea Science

On Speech Digitization and Bandwidth Compression Techniques[II]-Vocoding (음성신호의 디지탈화와 대역폭축소의 방법에 관하여 [II]-Vocoding)

은종관
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.15 no.5
- /
- pp.1-6
- /
- 1978
This paper is a sequel of the previous paperl) on speech digitization and bandwidth compression techniques. Several recently developed vocoding techniques, that is, linear predictive coding (LPC), formant vocoding, residual excited linear prediction (RELP) vocoding, and adaptive predictive coding(APC) are discussed. Throughout the leaper emphasis is placed on the LPC approach that is presently the most promising technique in speech compression. In addition, current problems and possible solutions are discussed.
PDF

A Study of BWE-Prediction-Based Split-Band Coding Scheme (BWE 예측기반 대역분할 부호화기에 대한 연구)

Song, Geun-Bae;Kim, Austin
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.6
- /
- pp.309-318
- /
- 2008
In this paper, we discuss a method for efficiently coding the high-band signal in the split-band coding approach where an input signal is divided into two bands and then each band may be encoded separately. Generally, and especially through the research on the artificial bandwidth extension (BWE), it is well known that there is a correlation between the two bands to some degree. Therefore, some coding gain could be achieved by utilizing the correlation. In the BWE-prediction-based coding approach, using a simple linear BWE function may not yield optimal results because the correlation has a non-linear characteristic. In this paper, we investigate the new coding scheme more in details. A few representative BWE functions including linear and non-linear ones are investigated and compared to find a suitable one for the coding purpose. In addition, it is also discussed whether there are some additional gains in combining the BWE coder with the predictive vector quantizer which exploits the temporal correlation.
PDF KSCI

Block Constrained Trellis Coded Vector Quantization of LSF Parameters for Wideband Speech Codecs

Park, Jung-Eun;Kang, Sang-Won
- ETRI Journal
- /
- v.30 no.5
- /
- pp.738-740
- /
- 2008
In this paper, block constrained trellis coded vector quantization (BC-TCVQ) is presented for quantizing the line spectrum frequency parameters of the wideband speech codec. Both a predictive structure and a safety-net concept are combined into BC-TCVQ to develop the predictive BC-TCVQ. The performance of this quantization is compared with that of the linear predictive coding vector quantizer used in the AMRWB codec, demonstrating reductions in spectral distortion.
PDF

Neural-network-based Driver Drowsiness Detection System Using Linear Predictive Coding Coefficients and Electroencephalographic Changes (선형예측계수와 뇌파의 변화를 이용한 신경회로망 기반 운전자의 졸음 감지 시스템)

Chong, Ui-Pil;Han, Hyung-Seob
- Journal of the Institute of Convergence Signal Processing
- /
- v.13 no.3
- /
- pp.136-141
- /
- 2012
One of the main reasons for serious road accidents is driving while drowsy. For this reason, drowsiness detection and warning system for drivers has recently become a very important issue. Monitoring physiological signals provides the possibility of detecting features of drowsiness and fatigue of drivers. One of the effective signals is to measure electroencephalogram (EEG) signals and electrooculogram (EOG) signals. The aim of this study is to extract drowsiness-related features from a set of EEG signals and to classify the features into three states: alertness, drowsiness, sleepiness. This paper proposes a neural-network-based drowsiness detection system using Linear Predictive Coding (LPC) coefficients as feature vectors and Multi-Layer Perceptron (MLP) as a classifier. Samples of EEG data from each predefined state were used to train the MLP program by using the proposed feature extraction algorithms. The trained MLP program was tested on unclassified EEG data and subsequently reviewed according to manual classification. The classification rate of the proposed system is over 96.5% for only very small number of samples (250ms, 64 samples). Therefore, it can be applied to real driving incident situation that can occur for a split second.
PDF KSCI

Time-Domain Quantization and Interpolation of Pitch Cycle Waveform

Kim, Moo-Young
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.1E
- /
- pp.11-16
- /
- 2008
In this paper, a pitch cycle waveform (PCW) is extracted, quantized, and interpolated in a time domain to synthesize high-quality speech at low bit rates. The pre-alignment technique is proposed for the accurate and efficient PCW extraction, which predicts the current PCW position from the previous PCW position assuming that pitch periods evolve slowly. Since the pitch periods are different frame by frame, the original PCW is converted into the fixed-dimension PCW using the dimension-conversion method, and subsequently quantized by code-excited linear predictive (CELP) coding. The excitation signal for the linear predictive coding (LPC) synthesis filter is generated using the time-domain interpolation and interlink of the quantized PCW's. The coder operates at 4.2 kbit/s and 3.2 kbit/s depending on the pitch period. Informal listening test demonstrates the effectiveness of the proposed coding scheme.
PDF KSCI

Fault Diagnosis System of Rotating Machines Using LPC Residual Signal Energy (LPC 잔여신호의 에너지를 이용한 회전기기의 고장진단 시스템)

Lee, Sung-Sang;Cho, Sang-Jin;Chong, Ui-Pil
- Journal of the Institute of Convergence Signal Processing
- /
- v.6 no.3
- /
- pp.143-147
- /
- 2005
Monitoring and diagnosis of the operating machines are very important for safety operation and maintenance in the industrial fields. These machines are most rotating machines and the diagnosis of the machines has been researched for long time. We can easily see the faulted signal of the rotating machines from the changes of the signals in frequency. The Linear Predictive Coding(LPC) is introduced for signal analysis in frequency domain. In this paper, we propose fault detection and diagnosis method using the Linear Predictive Coding(LPC) and residual signal energy. We applied our method to the induction motors depending on various status of faulted condition and could obtain good results.
PDF

A Study on the Affinity Between Pairs of Korean Vowels Using the Dynamic Paremeters of Vocal Tract (성도의 다이내믹 피라미터에 의한 한글 모음간의 근사도에 관한 연구)

김중규;안수길
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.19 no.1
- /
- pp.1-8
- /
- 1982
Many researches on the parametric representation of speech ,signals using the adaptive linear prediction method have been studied for the past few years. In this paper, we used the LPC(Linear Predictive Coding)method to analyae the parameters of Korean vowels and by using those parameters we studied the affinity between every pair of Korean vowels. As a result of our study, it is found that each pair of Korean vowels that has a greater phonetic affinity also has a greater affinity of vocal tract parameters than other pairs.
PDF

Adaptive Linear Predictive Coding of Time-varying Images Using Multidimensional Recursive Least-squares Ladder Filters

Nam Man K.;Kim Woo Y.
- Journal of the military operations research society of Korea
- /
- v.13 no.1
- /
- pp.1-18
- /
- 1987
This paper presents several adaptive linear predictive coding techniques based upon extension of recursive ladder filters. A 2-D recursive ladder filter is extended to a 3-D case which can adaptively track the variation of both spatial and temporal changes of moving images. Using the 2-D/3-D ladder filter and a previous farme predictor, two types of adaptive predictor-control schemes are proposed in which the prediction error at each pel can be obtained at or close to a minimum level. We also investigate several modifications of the basic encoding methods. Performance of the 2-D/3-D ladder filters, their adaptive control schemes, and variations in coding methods are evaluated by computer simulations on a real sequence and compared to the results of motion compensation and frame differential coders. As a validity test of the ladder filters developed, the error signals for the different predictors are compared and evaluated.
PDF

Improvement of the Linear Predictive Coding with Windowed Autocorrelation (윈도우가 적용된 자기상관에 의한 선형예측부호의 개선)

Lee, Chang-Young;Lee, Chai-Bong
- The Journal of the Korea institute of electronic communication sciences
- /
- v.6 no.2
- /
- pp.186-192
- /
- 2011
In this paper, we propose a new procedure for improvement of the linear predictive coding. To reduce the error power incurred by the coding, we interchanged the order of the two procedures of windowing on the signal and linear prediction. This scheme corresponds to LPC extraction with windowed autocorrelation. The proposed method requires more calculational time because it necessitates matrix inversion on more parameters than the conventional technique where an efficient Levinson-Durbin recursive procedure is applicable with smaller parameters. Experimental test over various speech phonemes showed, however, that our procedure yields about 5 % less power distortion compared to the conventional technique. Consequently, the proposed method in this paper is thought to be preferable to the conventional technique as far as the fidelity is concerned. In a separate study of speaker-dependent speech recognition test for 50 isolated words pronounced by 40 people, our approach yielded better performance too.
https://doi.org/10.13067/JKIECS.2011.6.2.186 인용 PDF KSCI

Quantization of LPC Coefficients Using a Multi-frame AR-model (Multi-frame AR model을 이용한 LPC 계수 양자화)

Jung, Won-Jin;Kim, Moo-Young
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.2
- /
- pp.93-99
- /
- 2012
For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.
https://doi.org/10.7776/ASK.2012.31.2.093 인용 PDF KSCI

Search Result 71, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)