• Title/Summary/Keyword: vocoder

Search Result 151, Processing Time 0.023 seconds

On a Performance Comparison of Pitch Search Algorithms with the Correlation Properties for the CELP Vocoder (상관관계 특성을 이용한 CELP 보코더의 피치검색시간 단축법의 비교)

  • 김대식
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.188-194
    • /
    • 1994
  • Code excited linear prediction speech coders exhibit good performance at data rates as low as 4800bps. But the major drawback to CELP type coders is their large computational requirements. Therefore, in this paper a comparative performance study of three pitch searching algorithms for the CELP vocoder was conducted. For each of the algorithms, a standard pitch searching algorithm was used by the full pitch searching algorithm that was implimented in the QCELP vocoder. The algorithms used in this study is to reduce the pitch searching time 1) using the skip table, 2) using the symmetrical property of the autocorrelation , and 3) using the preprocessing autocorrelation, 4) using the positive autocorrelation, 5) using the preliminary pitch. Performance scores are presented for each of the five pitch searching algorithms based on computation speed and on pitch prediction error.

  • PDF

An Algorithm to Reduce the Pitch Computational Complexity Using Modified Delta Searching in G.723.1 Vocoder (CELP 보코더에서 델타 피치 검색 방법 개선에 대한 연구)

  • Min, So-Yeon;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.165-172
    • /
    • 2004
  • In this paper, we propose the computational complexity reduction methods of delta pitch search that is used in G.723.1 vocoder. In order to decrease the computational complexity in delta pitch search the characteristic of proposed algorithms is as the following. First, scheme to reduce the computational complexity in delta pitch search uses NAMDF. Developed the second scheme is the skipping technique of lags in pitch searching by using the threshold value. By doing so, we can reduce the computational amount of pitch searching more than 64% with negligible quality degradation.

  • PDF

Improvement of Bit Rate applying the Speaking Rate and PSOLA Technique of Speech in CELP Vocoder (음성신호의 발성율과 PSOLA기법을 적용한 음성 보코더 전송률 개선에 관한 연구)

  • 장경아;서지호;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2003.11a
    • /
    • pp.45-48
    • /
    • 2003
  • In general, speech coding methods are classified into the following three categories: the waveform coding, the source coding and the hybrid coding. Fast speaking is possible to encode with a few information compared with slow speaking rate. In case of speaking rate, low frequency band is more important than high frequency band while listening. Speech vocoding technique is developing to way with low bit rate and complexity and high sound quality. the CELP type of vocoder support very good sound quality with low bit rate but these vocoders don't consider about the speaking rate. When we consider speaking rate and encode the frame depending on the speaking rate, the bit rate is able to reduce the bit rate than the conventional vocoder. We propose the technique to estimate the speaking rate and applied PSOLA technique in case of the frame of slow speaking rate. As a result of simulation bit rate can be reduced about 300 bps.

  • PDF

On Improving the Quality of RELP Vocoder (RELP Vocoder의 음질 향상에 관한 연구)

  • 오성근;은종관
    • The Journal of the Acoustical Society of Korea
    • /
    • v.5 no.1
    • /
    • pp.11-16
    • /
    • 1986
  • 지금까지 알려진 여러 가지 음성부호화 방식들 중 4.8에서 9.6kbits/s 사이의 전송속도에서 제일 좋은 성능을 갖는 것은 Residual-Excited linear prediction 방식이다. RELP 부호화 방식은 전송속도가 낮을 때 합성음이 거칠거나 금속성의 잡음을 갖는 단점이 있다. 본 논문에서는 이러한 단점을 보완하여 음질을 개선하는 세가지의 방법들을 제안하며, 그들은 다음과 같다. 첫째는 여러개의 baseband를 이용한 spectral folding 방법이고, 둘째는 spectral folding 방법과 pulsed excitation 방법을 조합한 방법이며, 마 지막 방법은 여러개의 baseband를 사용한 spectral folding 방법과 pulsed excitation 방법을 조합한 방법 이다. 이 방법들을 사용하여 RELP vocoder의 음질을 많이 개선할 수 있으며, 9.6kbits/s 근처의 전송속 도에서 사용하기 위한 첫 번째 방법과 세 번째 방법은 spectral fording 이나 nonlinear distortion 방법 에서 문제가 되는 roughness 나 tonal noise를 거의 인지 할 수 없으며, 세 번째 방법이 첫 번째 방법보 다 우수하다. 두 번째 방법은 4.8 kbits/s 근처의 전송속도에 적합하며, 기존의 RELP 방식들에 비해 많 은 음질향상을 가져왔다. 제안한 세가지 방법들을 같은 조건에서 비교할 때 세 번째 방법이 가장 우수 하며, 이 경우 합성음은 원음과 거의 흡사하다.

  • PDF

Voice-Pishing Detection Algorithm Based on 3GPP2 SMV (3GPP2 SMV 기반의 보이스 피싱 검출 알고리즘)

  • Lee, Kye-Hwan;Chang, Joon-Hyuk
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.4
    • /
    • pp.92-99
    • /
    • 2008
  • We propose an effective voice-pishing detection algorithm based on the 3GPP2 selectable mode vocoder (SMV). The detection of voice pishing is performed based on a Gaussian mixture model (GMM) using decoding parameters of the SMV directly extracted from the decoding process of the transmitted speech information in the mobile phone. The experimental results indicate that SMV decoding parameters are effective in discriminating between general voice and phisher's voice and the performance is significantly acceptable when the proposed technique is applied.

A Study on a Analysis and Comparison of Preprocessing Technique for the Speech Compression (음성압축을 위한 전처리기법의 비교 분석에 관한 연구)

  • Jang, Kyung-A;Min, So-Yeon;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.125-136
    • /
    • 2003
  • Speech coding techniques have been studied to reduce the complexity and bit rate but also to improve the sound quality. CELP type vocoder, has used as a one of standard, supports the great sound quality even low bit rate. In this paper, the preprocessing of input speech to reduce the bit rate is the different with the conventional vocoder. The different kinds of parameter are used for the preprocessing so this paper is compared with theses parameters for finding the more appropriate parameter for the vocoder. The parameters are used to synthesize the speech not to encode or decode for coding technique so we proposed the simple algorithm not to have the influence on the processing time or the computation time. The parameters in used the preprocessing step are speaking rate, duration and PSOLA technique.

  • PDF

Real-Time Implementation of Speech Vocoder For Video Telephony (화상 전화용 음성 보코더의 실시간 구현)

  • Nam, Il-Ryong;Seo, Sung-Dae;Nam, Hyun-Do
    • Proceedings of the KIEE Conference
    • /
    • 1998.07g
    • /
    • pp.2414-2416
    • /
    • 1998
  • This paper presents real-time implementation of speech vocoder for PSTN video telephony using ITU G.723 16Kbps ADPCM algorithm. The ADPCM encoder accepts 8-bit PCM compressed signals and expends it to a 14-bit-per-sample. The predicted values are subtracted from encoded signals to produce difference signals. Adaptive quantization is performed on the difference signal to produce a 2-bit, output for transmission over the channel. Computer simulations and experiments were performed to evaluate the performance of the speech vocoder.

  • PDF

On a Reduction of Pitch Searching Time by Preliminary Pitch in the CELP Vocoder

  • Bae, Seong-Gyun;Kim, Hyung-Rae;Kim, Dae-Sik;Bae, Myung-Jin
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1104-1111
    • /
    • 1994
  • Code Excited Linear Prediction(CELP) as a speech coder exhibits good performance at data rates below 4.8 kbps. The major drawback to CELP type coders is their large amount of computation. In this paper, we propose a new pitch search method that preserves the quality of the CELP vocoder with reduced complexity. The basic idea is to restrict the pitch searching range by estimating the preliminary pitches. Applying the proposed method to the CELP vocoder, we can get approximately 87% complexity reduction in the pitch search.

  • PDF

Real-time Implementation of Variable Transmission Bit Rate Vocoder Integrating G.729A Vocoder and Reduction of the Computational Amount SOLA-B Algorithm Using the TMS320C5416 (TMS320C5416을 이용한 G.729A 보코더와 계산량 감소된 SOLA-B 알고리즘을 통합한 가변 전송율 보코더의 실시간 구현)

  • 함명규;배명진
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.6
    • /
    • pp.84-89
    • /
    • 2003
  • In this paper, we real-time implemented to the TMS320C5416 the vocoder of variable bit rate applied the SOLA-B algorithm by Henja to the ITU-T G.729A vocoder of 8kbps transmission rate. This proposed method using the SOLA-B algorithm is that it is reduced the duration of the speech in encoding and is played at the speed of normal by extending the duration of the speech in decoding. At this time, we bandied that the interval of cross correlation function if skipped every 3 sample for decreasing the computational amount of SOLA-B algorithm. The real-time implemented vocoder of C.729A and SOLA-B algorithm is represented the complexity of maximum that is 10.2MIPS in encoder and 2.8MIPS in decoder of 8kbps transmission rate. Also, it is represented the complexity of maximum that is 18.5MIPS in encoder and 13.1MIPS in decoder of 6kbps, it is 18.5MIPS in encoder and 13.1MIPS in decoder of 4kbps. The used memory is about program ROM 9.7kwords, table ROM 4.5kwords, RAM 5.1 kwords. The waveform of output is showed by the result of C simulator and Bit Exact. Also, for evaluation of speech quality of the vocoder of real-time implemented variable bit rate, it is estimated the MOS score of 3.69 in 4kbps.

On a Detection of V-UV Segments of Speech Spectrum for the MBE Coding (MBE 부호화용 스펙트럼 V-UV 구간 검출에 관한 연구)

  • 김을제
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1992.06a
    • /
    • pp.43-48
    • /
    • 1992
  • In the area of speech vocoder systems, the MBE vocoder allows the high quality and low bit rate. In the MBE parameters detection, the dicision methods of V/UV region proposed until now are dependent highly to the other parameters, fundamental frequency and formant information. In this paper, thus, we propose a new V/UV detection method that uses a zero-crossing rate of flatten harmonices spectrum. This method can reduce the influences of the other parameters for the V/UV regions detection.

  • PDF