• 제목/요약/키워드: linear predictive coding

검색결과 71건 처리시간 0.03초

NEC 7720 DSP를 이용한 SBC codec의 실시간 구현 (Real-Time Implementation of a SBC Codec Using a NEC 7720 DSP)

  • 오수환;이상욱
    • 대한전자공학회논문지
    • /
    • 제23권4호
    • /
    • pp.429-438
    • /
    • 1986
  • In this paper we have designed and implemented a real-time, full-duplex SBC (sub-band coding) codec at 16kbps using a high speed digital signal processor, NEC 7720. The SBC codec employs a QMF(quadrature mirror filter) filter bank based on the tree structures of two-band analysis-synthesis pairs to partition speech signal into 4 octabe bands. Computer simulation has been done to investigate the effect of fixed-point computation of the NEC 7720. Three different performance measures, the conventional signal-to-noise ratio, the informal listening test, and an LPC(linear predictive coding)distance measure, have been used in this simulation. The necessary parameters have been optimized through the simulation. The developed hardware and software have been tested in real-time operation using a hardware emulator.

  • PDF

전이구간 부호화를 이용한 2.4 kbit/s 다중모드 음성 부호화 방법 (Method of a Multi-mode Low Rate Speech Coder Using a Transient Coding at the Rate of 2.4 kbit/s)

  • 안영욱;김종학;이인성;권오주;배문관
    • 대한전자공학회논문지SP
    • /
    • 제42권2호
    • /
    • pp.131-142
    • /
    • 2005
  • 현재 개발된 4 kbit/s이하의 저 전송율 음성부호화 시스템은 STC(Sinusoidal Transform Coding)나 MBE (Multi-band Excitation Coding)에 바탕을 두고 있다. 이러한 저 전송율 부호화기들은 대표적인 전이구간 신호인 유성음의 시작점과 끝점에서의 혼합신호(onset signal, offset signal), 비주기적인 신호(non-period signal) 등은 정확히 표현하지 못하기 때문에 자연스런 음질을 만들어 내지 못한다. 본 논문에서는 유성음에는 하모닉 모델, 무성음에서는 스토케스틱 모델, 전이구간에는 하모닉 기반의 비주기적인 펄스의 위치를 추적하는 방식을 사용하여 효과적으로 전이구간을 모델링 하는 방법과 2.4 kbit/s 다중모드 부호화방법을 제안한다. 제안한 방법은 원본신호에서 선형예측 부호화 방법으로 추출된 잔여신호를 신호의 성격에 따라 모델을 달리하는 방법이며, 자각의 신호의 성격에 따라 좋은 성능을 나타내는 모델을 사용하였다. 또한 효율적인 전이구간 모델링 방법의 도입으로 저 전송율에서 CELP(Code Excitation Linear Predictive) 부호화 방식에 의해 시간축에서 합성되는 여기신호와 선형위상을 이용한 하모닉 부호화 방식에 의해 주파수축에서 합성되는 여기신호를 효율적으로 결합이 가능하다는 것이 제안된 2.4 kbit/s 다중모드 부호화기의 장점이다. 제안된 방법의 2.4kbit/s 다중모드 부호화기는 미국 연방 표준부호화기인 2.4 kbit/s MELP(Mixed Excitation Linear Prediction) 부호화기보다 더 좋은 성능을 나타낸다.

자기 상관감법에 의한 잡음음성의 개선된 LPC 해석 (Improving LPC Analysis of Noisy Speech by Autocorrelation Subtraction Method)

  • 은종관;최기영
    • 한국음향학회지
    • /
    • 제1권1호
    • /
    • pp.45-53
    • /
    • 1982
  • A robust linear predictive coding method that can be used in noisy as well as quiet environment has been studied. In this method, noise autocorrelation coeffieients are first obtained and updated during nonspeech periods. Then, the effect of additive noise in the input speech is removed by subtracting values of the noise autocorrelation coefficients of corrupted speech in the course of computation of linear prediction coefficients. When signal-to-noise ratio of the input speech ranges from 0 to 10 dB, a performance improvement of about 5 dB can be gained by using this method. The proposed method is computationally very efficient and requires a small storage area.

  • PDF

G.723.1 보코더에서 주파수 간격 정보조절을 통한 계산량 감소에 관한 연구 (A Study on Reduction of Computation Time through Adjustment the Frequency Interval Information in the G.723.1 Vocoder)

  • 민소연;김영규;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.405-408
    • /
    • 2002
  • LSP(Line Spectrum Pairs) Parameter is used for speech analysis in vocoders or recognizers since it has advantages of constant spectrum sensitivity. low spectrum distortion and easy linear interpolation. However the method of transforming LPC(Linear Predictive Coding) into LSP is so complex that it takes much time to compute. Among conventional methods, the real root method is considerably simpler than others, but nevertheless, it still suffers from its jndeterministic computation time because the root searching is processed sequentially in frequency region. We suggest a method of reducing the LSP transformation time using voice characteristics The proposed method is to apply search order and interval differently according to the distribution of LSP parameters. in comparison with the conventional real root method, the proposed method results in about 46.5% reduction. And, the total computation time is reduce to about 5% in the G.723.1 vocoder.

  • PDF

음성 부호화기에서 불균등 간격조절을 통한 계산량 단축법 (A Reduction Method of Computational Complexity through Adjustment the Non-Uniform Interval in the Vocoder)

  • 전우진
    • 한국산학기술학회:학술대회논문집
    • /
    • 한국산학기술학회 2010년도 춘계학술발표논문집 1부
    • /
    • pp.277-280
    • /
    • 2010
  • LSP(Line Spectrum Pairs) Parameter is used for speech analysis in vocoders or recognizers since it has advantages of constant spectrum sensitivity, low spectrum distortion and easy linear interpolation. However the method of transforming LPC(Linear Predictive Coding) into LSP is so complex that it takes much time to compute. Among conventional methods, the real root method is considerably simpler than others, but nevertheless, it still suffers from its indeterministic computation time because the root searching is processed sequentially in frequency region. We suggest a method of reducing the LSP transformation time using voice characteristics.

  • PDF

ZCR과 PARCOR 계수를 이용한 숫자음성 인식 (Spoken digit recognition Using the ZCR and PARCOR Coefficient)

  • 김학윤
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1985년도 학술발표회 논문집
    • /
    • pp.75-78
    • /
    • 1985
  • 본 연구는 시간 영역의 parament를 이용하여 한국어 숫자음(영, 일, 이, 삼, 사, 오, 육, 칠, 팔, 구)을 인식했다. 입력 음성 신호 X(n)의 Beginning Point와 Ending point를 ZCR(Zero-crossing Rate), Magnitude, Energy, Autocorrelation을 이용 Beginning point와 Ending point를 구하고 자음부의 인식은 위 계수들을 이용하여 행했다. 또, 유성음 부분에서는 PARCOR(Partial Autocorrelation), LPC(Linear Predictive Coding)를 이용 모음부와 유성자음을 인식하여 모음을 6개 부류(ㅏ, ㅑ, ㅗ, ㅜ, ㅠ, ㅣ)로 구분 인식했다. 이 방법에 의하면 입력 음성 신호 X(n)의 B.P(Beginning Point)와 E.P(Ending Point)를 쉽게 추출 가능하며 또한 각 Parameter를 이용하여 94.4%의 인식율을 얻었다.

  • PDF

자유표면에서의 수중함 심도제어 시스템 성능 개선 (Performance Enhancement of Auto-Depth Control System for Submersed Body in Near Surface Environment)

  • 이석필;윤형식;박상희
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1991년도 한국자동제어학술회의논문집(국내학술편); KOEX, Seoul; 22-24 Oct. 1991
    • /
    • pp.637-641
    • /
    • 1991
  • One of the most difficult problems in depth control for underwater vehicle is the effect of seaway disturbance. When a underwater vehicle operates in a near surface environment, the seaway generates essentially two types of stochastic disturbances that influence the boat notion. One component of the seaway forces is of large magnitude with a relatively narrow-band, first order component. The other component is generally of somewhat smaller magnitude, second order component. Since the magnitude of the first order component is generally such greater than the compensating force that can be generating by the planes, it is undesirable for the controller to generate a control command. In this paper, we used LPC(Linear Predictive Coding) processing to uncontrollable seaway disturbance. This method can be used extensively in sensor signal processing of underwater vehicles.

  • PDF

IC 설계용 집적형 캐드 시스템의 구현 (An Implementation of integrated CAD system of IC design)

  • 공진흥;김성중;김재협
    • 전자공학회논문지A
    • /
    • 제30A권1호
    • /
    • pp.73-85
    • /
    • 1993
  • This paper presents a design and implementation of CAD(Computer-Aided Design) system with tools and design environments for IC(Intergrated Circuits)design. The CAD system can be easily installed in various sites with limited resources, since most CAD tools and design environments are available in the public-domain and Unix & X Window-based PC-386 and Workstation is used for the hardware platform. In order to improve the flexibility of the CAD system, objects are defined in the context of tools and environments` and object tables are programmed to describe the integration of CAD tools and design environments. During the execution, tool-objects deal with intertool communication and round-robin mechanism to incrementally control the execution of CAD tools. The IC design of LPC(Linear Predictive Coding) Speech Synthesizer is carried out to find out improvements and bugs of the CAD system.

  • PDF

An Implementatin of a Multi-Channel Speech Surveillance System Over Telephone Lines

  • Kim, Sung-Soo
    • The Journal of the Acoustical Society of Korea
    • /
    • 제17권4E호
    • /
    • pp.17-21
    • /
    • 1998
  • This paper presents an implementation of a multi-channel speech surveillance system over telephone lines using TMS320C31 DSP chips. The incoming speech into each telephone line are first compressed simultaneously in real-time by the popular vector-sum excited linear predictive (VSELP) speech coding algorithm at the rate of 8 Kbps. The compressed steech bit streams are then multiplexed with those of other users. The multiplexed speech bit streams are transferred to the system storage equipments with some other required information so that a system operator can later monitor the stored speech data whenever it is necessary. The host program runs under Microsoft Windows95 for an efficient man-machine interface and a future upgrade-ability. We have confirmed that the overall 64-channel system operates satisfactorily in realtime. We also have checked approximately up to 2,880 total hours of recording capability of the system on a playback module and two removable backup drives.

  • PDF

Hidden LMS 적응 필터링 알고리즘을 이용한 경쟁학습 화자검증 (Speaker Verification Using Hidden LMS Adaptive Filtering Algorithm and Competitive Learning Neural Network)

  • 조성원;김재민
    • 대한전기학회논문지:시스템및제어부문D
    • /
    • 제51권2호
    • /
    • pp.69-77
    • /
    • 2002
  • Speaker verification can be classified in two categories, text-dependent speaker verification and text-independent speaker verification. In this paper, we discuss text-dependent speaker verification. Text-dependent speaker verification system determines whether the sound characteristics of the speaker are equal to those of the specific person or not. In this paper we obtain the speaker data using a sound card in various noisy conditions, apply a new Hidden LMS (Least Mean Square) adaptive algorithm to it, and extract LPC (Linear Predictive Coding)-cepstrum coefficients as feature vectors. Finally, we use a competitive learning neural network for speaker verification. The proposed hidden LMS adaptive filter using a neural network reduces noise and enhances features in various noisy conditions. We construct a separate neural network for each speaker, which makes it unnecessary to train the whole network for a new added speaker and makes the system expansion easy. We experimentally prove that the proposed method improves the speaker verification performance.