• Title/Summary/Keyword: 오디오 부호화

Search Result 144, Processing Time 0.027 seconds

Enhancement of SBR for Speech Signal Using Adaptive Noise Floor Level (가변 잡음 레벨을 이용한 음성신호에 대한 SBR 성능 항상 기술)

  • Lee, Se-Won;Oh, Seoung-Jun;Ahn, Chang-Beom;Lee, Tae-Jin;Kang, Kyoung-Ok;Park, Ho-Chong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.148-154
    • /
    • 2009
  • In audio coding, SBR technology synthesizes the high-bands using patched time-frequency information from low-bands and the correction parameters, Since SBR transmits only correction parameters for high-bands, it provides a low-rate coding of high-bands, and is used as a core module of MPEG-4 HE-AAC, SBR was originally designed for audio signal and its performance for speech signal tends to decrease, and the major reason is an excessive noise floor in high-bands which is caused by incorrect tonality computation, In this paper, a new method to determine noise floor level in an adaptive fashion according to the speech characteristics is proposed in order to solve the problem of SBR for speech signal, The proposed method maintains the compatibility with the standard SBR, and the subjective performance evaluation shows that the proposed method improves the SBR performance especially for male speech signal compared with the standard SBR.

Application of Turbo Code for Digital Audio Broadcasting (DAB) System (디지털 오디오 방송을 위한 터보부호의 응용)

  • 김한종
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.13 no.2
    • /
    • pp.176-187
    • /
    • 2002
  • The digital Audio Broadcasting (DAB) system adopts Coded OFDM(COFDM) for channel coding. The COFDM is a combined technique of multicarrier transmission(OFDM) and punctured convolutional coding with viterbi error correction. Because the channel coding is an important topic for OFDM systems, this paper proposes a new turbo coded OFDM system that replaces the existing RCPC codec by a turbo codec without modifying the puncturing procedure and puncturing vectors defined in the standard DAB system for compatibility. The performance of a new system is compared to that of the conventional system under the frequency selective Rician fading channel and the frequency selective Rayleigh fading channel in conjunction with DAB transmission mode I suitable for the terrestrial single frequency network(SFN) broadcasting. The standard system's performance was improved with the aid of turbo codec.

Audio High-Band Coding based on Autoencoder with Side Information (부가 정보를 이용하는 오토 인코더 기반의 오디오 고대역 부호화 기술)

  • Cho, Hyo-Jin;Shin, Seong-Hyeon;Beack, Seung Kwon;Lee, Taejin;Park, Hochong
    • Journal of Broadcast Engineering
    • /
    • v.24 no.3
    • /
    • pp.387-394
    • /
    • 2019
  • In this study, a new method of audio high-band coding based on autoencoder with side information is proposed. The proposed method operates in the MDCT domain, and improves the performance by using additional side information consisting of the previous and current low bands, which is different from the conventional autoencoder that only inputs information to be encoded. Moreover, the side information in a time-frequency domain enables the high-band coder to utilize temporal characteristics of the signal. In the proposed method, the encoder transmits a 4-dimensional latent vector computed by the autoencoder and a gain variable using 12 bits for each frame. The decoder reconstructs the high band by applying the decoded low bands in the previous and current frames and the transmitted information to the autoencoder. Subjective evaluation confirms that the proposed method provides equivalent performance to the SBR at approximately half the bit rate of the SBR.

A Study on the Variable Transmission of xHE-AAC Audio Frame (xHE-AAC 오디오 프레임의 가변 전송에 관한 연구)

  • Lee, Bongho;Yang, Kyutae;Lim, Hyoungsoo;Hur, Namho
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.357-368
    • /
    • 2016
  • In DAB+, HE-AAC v2 codec is applied for the fixed rate transmission of audio stream. In case that xHE-AAC codec including USAC, a more efficiency is expected when the variable frame is used in a given same bandwidth compared to the fixed frame transmission. For this to be realized, audio streams need to be multiplexed in a sub-channel before transmission, then a method is required to identify the border of each audio frames. In this paper, the toggled sync byte and additional identification field being sequentially placed between AU borders are proposed in order to deal with the AU border identification. In addition, the Reed-Solomon based error correction code which is compliant to DAB+ is proposed.

Quality Assessment and Predistortion Evaluation of the Multi-channel Audio Codec according to the bitrate changing (압축율 변화에 따른 멀티채널 오디오의 품질 및 Predistortion 의 영향 평가)

  • Cha, Kyung-Hwan;Jang, Dae-Young;Kim, Sung-Han;Kim, Chun-Duck
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.2
    • /
    • pp.55-60
    • /
    • 1996
  • This paper describes the subjective assessment of the multi-channel audio quality according to the bitrate changing and evaluates the predistortion effect to avoid the unmasked noise after matrixing/dematrxing process in transmission and regeneration of the multi-channel audio. The simulation is processed by the perceptual coding that is MPEG-2 Audio layer II algorithm. We evaluate the quality improvement about predistortion using or not by 384, 320, 256, 128kbps. As the result of the double blind subjective assessment, 5 Grade-Impairment Scale is scored under minus one to 320kbps and so audio quality is evaluated to be perceptible, but not annoying in 3/2 channel. The effect of the predistortion is improved one level in 128kbps and especially speech test material I better improved than music test materials.

  • PDF

Frequency Band Selection Exited Linear Prediction Wideband Speech/Audio Coding Using SBR (SBR을 이용한 주파수 밴드선택 여기 선형예측 광대역 음성/오디오 부호화)

  • Jang, Sunghoon;Lee, Insung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.6
    • /
    • pp.556-562
    • /
    • 2013
  • This paper is aimed to improve performance of Band-Selection speech/audio Coder reconstucted band spectrum that is not sent by the comfort noise. To improve the performance, we use the Spectral Band Replication(SBR) technique instead of substitution of Comfort noise. To synthesize SBR signal, the SBR algorithm is referenced in selected signals and the spectrum synthesized by SBR is injected to non-selected band. Each sub-band spectrum has been energy-weighted by real audio signal. We propose the enhanced the Band-Selection Coder that utilizes synthesized SBR signal from selected signal instead of comfort noise.

Audio Coder Using Variable Subband Wavelet Filter (가변 대역분할 웨이블릿필터를 이용한 오디오 부호화기)

  • 김준성;강현철;변윤식
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.5
    • /
    • pp.57-62
    • /
    • 1998
  • 본 논문에서는 입력신호의 시변특성에 따라 분석 필터의 대역을 가변 시키는 필터 뱅크의 구조를 제안한다. 제안된 필터뱅크는 일반적으로 32개의 균일한 대역으로 나누어 임 계대역의 표현을 적절히 표현하지 못하는 Polyphase 필터의 단점을 극복하면서 시스템 설 계에 높은 계산량을 요구하는 QMF-tree 필터의 단점을 보완한다. 본 연구에서는 분할 대역 은 4개에서 26개의 대역으로 가변하고, 웨이블릿 필터중 Daubechies필터를 사용하였다. 제 안된 구조의 부호화기는 128kbps에서 MPEG-a오디오와 비슷한 수준의 CD 음질을 유지하 며, 연산량 비교결과는 PolyPhase filter를 이용한 MPEG보다 부호화, 복호화 과정을 합쳐 다양한 전송률과 음원에서 평균 19%의 감소를 얻었다.

  • PDF

Library Optimization of the MPEG-4 Audio HVXC Coder using TMS320C6701 DSP (TMS320C6701 DSP용 MPEG-4 오디오 HVXC 부호기의 최적화 라이브러리 개발)

  • Na, Hoon;Lee, Ji-Woong;Kang, Kyeong-Ok;Lim, Young-Kwon;Hong, Jin-Woo;Jeong, Dae-Gwon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1999.11b
    • /
    • pp.197-200
    • /
    • 1999
  • MPEG-4 오디오 부호기의 일부인 HVXC(Harmonic and Vector excitation Coding) 부호기는 음성의 무성음 구간에서는 CELP 코덱, 유성음 구간에서는 MBE 코덱을 이용하여 부호화하는 구조로서, 많은 연산량을 필요로 하여 범용DSP를 이용한 실시간 구현의 장애요소로 작용한다. 본 논문에서는 TMS320C6701 DSP를 이용하여 많은 연산 시간을 요하는 함수들에 대한 C언어 및 어셈블리 레벨의 최적화를 수행하여 HVXC 함수들의 실행시간을 단축하고 이를 라이브러리화 하여 실시간 구현에 이용가능 하도록 하였다.

  • PDF

Content-based music retrieval using temporal characteristics (Temporal 특성을 이용한 내용기반 음악 정보 검색)

  • Park Chuleui;Park Mansoo;Kim Sungtak;Kim Hoi-Rin;Kang Kyeongok
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.299-302
    • /
    • 2004
  • 본 논문에서는 내용 기반 음악 정보 검색에 음악의 temporal 특징을 이용한 검색 방법을 제안한다. 방송환경에 적용하기 위해 검색 범위를 드라마나 영화의 배경 음악으로 사용되는 OST 앨범으로 제한하였다. 오디오의 특징 벡터로써 UFCC(Mel Frequency Cepstral Coefficient)를 사용하였으며 이 특징 벡터를 이용하여 VQ(Vector Quantization)로 부호화한 codeword로 오디오 신호의 시변 특성을 표현한다. 본 논문에서는 제안한 음악의 temporal 특성을 반영한 codeword-sequence를 이용하는 방법을 pitch-histogram을 기반으로 하는 방법 및 MFCC codeword-histogram을 기반으로 하는 방법과 비교하고 성능 개선을 보여주었다.

  • PDF

Overview of MPEG Surround (MPEG Surround 표준화 동향 및 기술 분석)

  • Jang In-Seon;Beack Seung-Kwon;Seo Jeong-Il;Jang Dae-Young
    • Journal of Broadcast Engineering
    • /
    • v.11 no.2 s.31
    • /
    • pp.181-190
    • /
    • 2006
  • Technology for compressing low-bitrate multichannel audio coding should be developed owing to the increasing need of consumer for multichannel audio contents and services. To meet this requirement, MPEG has standardized MPEG Surround. In this paper, we introduce status on MPEG Surround standardization and analyze techniques adopted in the current MPEG Surround.