• Title/Summary/Keyword: Pitch coding

Search Result 67, Processing Time 0.03 seconds

Tandemless Transcoding for AMR and EVRC Speech Coders (AMR과 EVRC 음성 부호화기간의 비탠덤 방식을 이용한 상호 부호화)

  • 이선일;유창동
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.6
    • /
    • pp.531-542
    • /
    • 2002
  • Novel tandemless transcoding method for AMR and EVRC speech coders is proposed in this paper. In contrast to conventional tandem method, the parameters which is used commonly in speech coder where CELP algorithm is adapted are directly transcoded. The proposed algorithm is composed of LSP transcoding, pitch delay transcoding, gains transcoding and fixed codebook vector transcoding Evaluation results show that the novel algorithm achieves better speech quality than tandem method and reduce computational complexity and delay.

On a Waveform Coding Technique Using Pitch Searching and 3-Level Clipping (피치검색기법과 3-Level Clipping을 이용한 음성 파형부호화법에 관한 연구)

  • 김규홍
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.127-130
    • /
    • 1998
  • 피치검색과 3-level clipping을 이용한 새로운 파형부호화법을 제안하고자 한다. 제안한 방법에서는 우선 피치를 검출하여 기준피치파형과 인근피치파형을 검색한다. 그 후 유사도 측정시 기준피치파형과 인근 피치파형에 대해 3-level clippng을 수행한다. 클리핑된 기준피치파형가 인근피치파형간의 유사도를 측정하여 유사성이 크다면 피치정보와 에너지 정보만을 전송하거나 저장하여 압축을 하고, 유사성이 적다면 인근피치파형을 압축을 하지 않고 저장한다. 그 후에 저장된 ?ㅇ을 기준 피치파형으로 재정의 하여 다시 반복적으로 압축을 수행한다. 압축된 음성신호를 다시 복원할 때에는 수신 또는 저장된 음성신호를 이용하여 PSOLA 방식으로 합성을 수행한다. 평균압축율이 약 65%일 경우에도, MOS 값이 4이상을 유지하였다.

  • PDF

The Development of Boiler Combustion Air Control Algorithm for Coal-Fired Power Plant (석탄화력발전소 보일러 연소용 공기 제어알고리즘의 개발)

  • Lim, Gun-Pyo;Lee, Heung-Ho
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.61 no.4
    • /
    • pp.153-160
    • /
    • 2012
  • This paper is written for the development of boiler combustion air control algorithm of coal-fired power plant by the steps of design, coding and test. The control algorithms were designed in the shape of cascade control for two parts of air master, forced draft fan pitch blade by standard function blocks. This control algorithms were coded to the control programs of distributed control systems under development. The simulator for coal-fired power plant was used in the test step and automatic control, sequence control and emergency stop tests were performed successfully like the tests of the actual power plant. The reliability will be obtained enough to apply to actual site if the total test has been completed in the state that all algorithms were linked mutually. It is expected that the project result will contribute to the safe operation of domestic power plant and the self-reliance of coal-fired power plant control technique.

A 4 kbps PSI-VSELP Speech Coding Algorithm (4 kbps PSI-VSELP 음성 부호화 알고리듬)

  • Choi, Yong-Soo;Kang, Hong-Goo;Park, Sang-Wook;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.6
    • /
    • pp.59-65
    • /
    • 1996
  • This paper proposes a 4 kbps PSI-VSELP(Pitch Synchronous Innovation-Vector Sum Excited Linear Prediction) speech coder which produces speech equivalent to that of the conventional 4.8 kbps VSELP. Since the 'half-rate' is differently defined from country to country, there may be a need to reduce the bit rate of conventional half-rate coder. To minimize the degradation of speech quality caused by bit-rate reduction, it is desirable to perform bit-allocation based on the carefull consideration of the effect of various transmission parameters. This paper adopts this analytical approach for bit-allocation at 4 kbps. To improve the quality of the VSELP coder at 4 kbps, basis vectors which play the most important role in the performance, are optimized by an iterative closed-loop training process and the PSI technique is employed in the VSELP performance, are optimized by an iterative closed-loop training process and the PSI technique is employed in the VSELP coder. To demonstrate the performance of the proposed speech coder, we peformed experiments under the noiseless and error free conditions. From experimental results, even though the proposed 4 kbps PSI-VSELP coder showed lower scores in the objective measure, higher scores in subjective measure was obtained compared with those of the conventional 4.8 kbps VSELp.

  • PDF

A Study on A Multi-Pulse Linear Predictive Filtering And Likelihood Ratio Test with Adaptive Threshold (멀티 펄스에 의한 선형 예측 필터링과 적응 임계값을 갖는 LRT의 연구)

  • Lee, Ki-Yong;Lee, Joo-Hun;Song, Iick-Ho;Ann, Sou-Guil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.1
    • /
    • pp.20-29
    • /
    • 1991
  • A fundamental assumption in conventional linear predictive coding (LPC) analysis procedure is that the input to an all-pole vocal tract filter is white process. In the case of periodic inputs, however, a pitch bias error is introduced into the conventional LP coefficient. Multi-pulse (MP) LP analysis can reduce this bias, provided that an estimate of the excitation is available. Since the prediction error of conventional LP analysis can be modeled as the sum of an MP excitation sequence and a random noise sequence, we can view extracting MP sequences from the prediction error as a classical detection and estimation problem. In this paper, we propose an algorithm in which the locations and amplitudes of the MP sequences are first obtained by applying a likelihood ratio test (LRT) to the prediction error, and LP coefficients free of pitch bias are then obtained from the MP sequences. To verify the performance enhancement, we iterate the above procedure with adaptive threshold at each step.

  • PDF

Efficient Harmonic-CELP Based Low Bit Rate Speech Coder (효율적인 하모닉-CELP 구조를 갖는 저 전송률 음성 부호화기)

  • 최용수;김경민;윤대희
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.5
    • /
    • pp.35-47
    • /
    • 2001
  • This paper describes an efficient harmonic-CELP speech coder by taking advantages of harmonic and CELP coders into account. According to frame voicing decision, the proposed harmonic-CELP coder adopts the RP-VSELP coder as a fast CELP in case of an unvoiced frame, or an improved harmonic coder in case of a voiced frame. The proposed coder has main features as follows: simple pitch detection, fast harmonic estimation, variable dimension harmonic vector quantization, perceptual weighting reflecting frequency resolution, fast harmonic synthesis, naturalness control using band voicing, and multi-mode. These features make the proposed coder require very low complexity, compared with HVXC coder To demonstrate the performance of the proposed coder, a 2.4 kbps coder has been implemented and compared with reference coders. From results of informal listening tests, the proposed coder showed good quality while requiring low delay and complexity.

  • PDF

A Study on Vocal Removal Scheme of SAOC Using Harmonic Information (하모닉 정보를 이용한 SAOC의 보컬 신호 제거 방법에 관한 연구)

  • Park, Ji-Hoon;Jang, Dae-Geun;Hahn, Min-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.10
    • /
    • pp.1171-1179
    • /
    • 2013
  • Interactive audio service provide with audio generating and editing functionality according to user's preference. A spatial audio object coding (SAOC) scheme is audio coding technology that can support the interactive audio service with relatively low bit-rate. However, when the SAOC scheme remove the specific one object such as vocal object signal for Karaoke mode, the scheme support poor quality because the removed vocal object remain in the SAOC-decoded background music. Thus, we propose a new SAOC vocal harmonic extranction and elimination technique to improve the background music quality in the Karaoke service. Namely, utilizing the harmonic information of the vocal object, we removed the harmonics of the vocal object remaining in the background music. As harmonic parameters, we utilize the pitch, MVF(maximum voiced frequency), and harmonic amplitude. To evaluate the performance of the proposed scheme, we perform the objective and subjective evaluation. As our experimental results, we can confirm that the background music quality is improved by the proposed scheme comparing with the SAOC scheme.

Improved Harmonic-CELP Speech Coder with Dual Bit-Rates(2.4/4.0 kbps) (이중 전송률(2.4/4.0 kbps)을 갖는 개선된 하모닉-CELP 음성부호화기)

  • 김경민;윤성완;최용수;박영철;윤대희;강태익
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.3C
    • /
    • pp.239-247
    • /
    • 2003
  • This paper presents a dual-rate (2.4/4.0 kbps) Improved Harmonic-CELP(IHC) speech coder based on the EHC(Efficient Harmonic-CELP) which was presented by the authors. The proposed IHC employs the harmonic coding for voiced and the CELP for unvoiced segments. In the IHC, an initial voiced/unvoiced estimate is obtained by the pitch gain and energy. Then, the final V/UV mode is decided by using the frame energy contour. A new harmonic estimation combining peak picking and delta adjustment provides a more reliable harmonic estimation than that in the EHC. In addition, a noise mixing scheme in conjunction with an improved band voicing measurement provides the naturalness of the synthesized speech. To demonstrate the performance of the proposed IHC coder, the coder has been implemented and compared with the 2.0/4.0 kbps HVXC(Harmonic excitation Vector Coding) standardized by MPEG-4. Results of subjective evaluation showed that the proposed IHC coder and produce better speech quality than the HVXC, with only 40% complexity of the HVXC.

Enhancement of SBR for Speech Signal Using Adaptive Noise Floor Level (가변 잡음 레벨을 이용한 음성신호에 대한 SBR 성능 항상 기술)

  • Lee, Se-Won;Oh, Seoung-Jun;Ahn, Chang-Beom;Lee, Tae-Jin;Kang, Kyoung-Ok;Park, Ho-Chong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.148-154
    • /
    • 2009
  • In audio coding, SBR technology synthesizes the high-bands using patched time-frequency information from low-bands and the correction parameters, Since SBR transmits only correction parameters for high-bands, it provides a low-rate coding of high-bands, and is used as a core module of MPEG-4 HE-AAC, SBR was originally designed for audio signal and its performance for speech signal tends to decrease, and the major reason is an excessive noise floor in high-bands which is caused by incorrect tonality computation, In this paper, a new method to determine noise floor level in an adaptive fashion according to the speech characteristics is proposed in order to solve the problem of SBR for speech signal, The proposed method maintains the compatibility with the standard SBR, and the subjective performance evaluation shows that the proposed method improves the SBR performance especially for male speech signal compared with the standard SBR.

The Development of Boiler Furnace Pressure Control Algorithm and Distributed Control System for Coal-Fired Power Plant (석탄화력발전소 보일러 노내압력 제어알고리즘과 분산제어시스템의 개발)

  • Lim, Gun-Pyo;Hur, Kwang-Bum;Park, Doo-Yong;Lee, Heung-Ho
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.62 no.3
    • /
    • pp.117-126
    • /
    • 2013
  • This paper is written for the development and application of boiler furnace pressure control algorithm and distributed control system of coal-fired power plant by the steps of design, coding, simulation test, site installation and site commissioning test. The control algorithms were designed in the shape of cascade control for two parts of furnace pressure control and induced draft fan pitch blade by standard function blocks. This control algorithms were coded to the control programs of distributed control systems. The simulator for coal-fired power plant was used in the test step and automatic control, sequence control and emergency stop tests were performed successfully like the tests of the actual power plant. The reliability was obtained enough to be installed at the actual power plant and all of distributed control systems had been installed at power plant and all signals were connected mutually. Tests for reliability and safety of plant operation were completed successfully and power plant is being operated commercially. It is expected that the project result will contribute to the safe operation of domestic new and retrofit power plants, the self-reliance of coal-fired power plant control technique and overseas business for power plant.