• Title/Summary/Keyword: Codec Model

Search Result 62, Processing Time 0.028 seconds

Transform Coding Based on Source Filter Model in the MDCT Domain

  • Sung, Jongmo;Ko, Yun-Ho
    • ETRI Journal
    • /
    • v.35 no.3
    • /
    • pp.542-545
    • /
    • 2013
  • State-of-the-art voice codecs have been developed to extend the input bandwidth to enhance quality while maintaining interoperability with a legacy codec. Most of them employ a modified discrete cosine transform (MDCT) for coding their extended band. We propose a source filter model-based coding algorithm of MDCT spectral coefficients, apply it to the ITU-T G.711.1 super wideband (SWB) extension codec, and subjectively test it to validate the model. A subjective test shows a better quality over the standardized SWB codec.

Side Information Extrapolation Using Motion-aligned Auto Regressive Model for Compressed Sensing based Wyner-Ziv Codec

  • Li, Ran;Gan, Zongliang;Cui, Ziguan;Wu, Minghu;Zhu, Xiuchang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.2
    • /
    • pp.366-385
    • /
    • 2013
  • In this paper, we propose a compressed sensing (CS) based Wyner-Ziv (WZ) codec using motion-aligned auto regressive model (MAAR) based side information (SI) extrapolation to improve the compression performance of low-delay distributed video coding (DVC). In the CS based WZ codec, the WZ frame is divided into small blocks and CS measurements of each block are acquired at the encoder, and a specific CS reconstruction algorithm is proposed to correct errors in the SI using CS measurements at the decoder. In order to generate high quality SI, a MAAR model is introduced to improve the inaccurate motion field in auto regressive (AR) model, and the Tikhonov regularization on MAAR coefficients and overlapped block based interpolation are performed to reduce block effects and errors from over-fitting. Simulation experiments show that our proposed CS based WZ codec associated with MAAR based SI generation achieves better results compared to other SI extrapolation methods.

Floating-Poing Quantization Error Analysis in Subband Codes System

  • Park, Kyu-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1E
    • /
    • pp.41-48
    • /
    • 1997
  • The very purpose of subband codec is the attainment of data rate compression through the use of quantizer and optimum bit allocation for each decimated signal. Yet the question of floating-point quantization effects in subband codec has received scant attention. There has been no direct focus on the analysis of quantization errors, nor on design with quantization errors embedded explicitly in the criterion. This paper provides a rigorous theory for the modelling, analysis and optimum design of the general M-band subband codec in the presence of the floating-point quantization noise. The floating-point quantizers are embedded into the codec structure by its equivalent multiplicative noise model. We then decompose the analysis and synthesis subband filter banks of the codec into the polyphase form and construct an equivalent time-invariant structure to compute exact expression for the mean square quantization error in the reconstructed an equivalent time-invariant structure to compute exact expression for the mean square quantization error in the reconstructed output. The optimum design criteria of the subband codec is given to the design of the analysis/synthesis filter bank and the floating-point quantizer to minimize the output mean square error. Specific optimum design examples are developed with two types of filter of filter banks-orthonormal and biorthogonal filter bank, along with their perpormance analysis.

  • PDF

Selective Quantization Based on Band Property for Wideband Signal Codec (광대역 신호 압축기를 위한 주파수 대역 특성에 선택적인 양자화 방법)

  • 송재종;박호종;김무영;김도석;김정수
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.7
    • /
    • pp.76-82
    • /
    • 2001
  • In this paper, a novel quantization method for wideband signal codec with 7 kHz bandwidth is proposed. In the transform-based wideband signal codecs, the signal is transformed to frequency domain and the spectral coefficients in each frequency band are quantized based on human perceptual model, followed by Huffman coding. However, the property of each band varies with frequency, and the codec has poor performance when all bands are quantized with the same method. Therefore, a selective quantization method is proposed, which analyzes the band property and selects the quantization domain between frequency domain and time domain based on the quantization efficiency. It is confirmed that the proposed method has better performance than the quantizer of G722.1 codec.

  • PDF

The CODEC Performance Analysis of VoIP for QoS (QoS를 위한 인터넷전화의 CODEC 성능 분석)

  • Rha, Sung-Hun;Yoo, Jae-Duck
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.4 no.2
    • /
    • pp.93-100
    • /
    • 2009
  • As the Internet Protocol be widely/rapidly used in packet communication, common carriers are providing the multimedia service(Both direction real-time voice, video conference, remote educational etc.)on the Internet. Also the 070 VoIP (Voice over IP) service is provided by the carriers on the packer network. In order to offer VoIP service in Korea, common carrier has to acquire the optimum level for QoS(quality of service). In this paper, we study on CODEC quality to get a higher QoS for VoIP.

  • PDF

A 3D Audio Core-Codec Employing an Improved Buffer Control Method (향상된 버퍼 제어 방법을 사용한 3D 오디오 핵심 부호화기)

  • Kim, Rin Chul
    • Journal of Broadcast Engineering
    • /
    • v.25 no.2
    • /
    • pp.233-241
    • /
    • 2020
  • In this paper, a new buffer control method is proposed for improving the performance of the frequency domain part of the 3D audio (3DA) core codec. For the proposed buffer control method, we first combine the 3DA RM9 with the 3GPP AAC buffer control method which includes the psychoacoustic model and rate-distortion control process with the spectral hole avoidance algorithm. Then, we revise the 3GPP buffer control method so as to achieve a faithful bit allocation to the frames with higher activity. With the MUSHRA test, we prove that the proposed buffer control method demonstrates better performance than the 3DA RM9 and 3GPP AAC.

Design of FPGA Camera Module with AVB based Multi-viewer for Bus-safety (AVB 기반의 버스안전용 멀티뷰어의 FPGA 카메라모듈 설계)

  • Kim, Dong-jin;Shin, Wan-soo;Park, Jong-bae;Kang, Min-goo
    • Journal of Internet Computing and Services
    • /
    • v.17 no.4
    • /
    • pp.11-17
    • /
    • 2016
  • In this paper, we proposed a multi-viewer system with multiple HD cameras based AVB(Audio Video Bridge) ethernet cable using IP networking, and FPGA(Xilinx Zynq 702) for bus safety systems. This AVB (IEEE802.1BA) system can be designed for the low latency based on FPGA, and transmit real-time with HD video and audio signals in a vehicle network. The proposed multi-viewer platform can multiplex H.264 video signals from 4 wide-angle HD cameras with existed ethernet 1Gbps. and 2-wire 100Mbps cables. The design of Zynq 702 based low latency to H.264 AVC CODEC was proposed for the minimization of time-delay in the HD video transmission of car area network, too. And the performance of PSNR(Peak Signal-to-noise-ratio) was analyzed with the reference model JM for encoding and decoding results in H.264 AVC CODEC. These PSNR values can be confirmed according the theoretical and HW result from the signal of H.264 AVC CODEC based on Zynq 702 the multi-viewer with multiple cameras. As a result, proposed AVB multi-viewer platform with multiple cameras can be used for the surveillance of audio and video around a bus for the safety due to the low latency of H.264 AVC CODEC design.

The Implementation of Multi-Channel Audio Codec for Real-Time operation (실시간 처리를 위한 멀티채널 오디오 코덱의 구현)

  • Hong, Jin-Woo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.2E
    • /
    • pp.91-97
    • /
    • 1995
  • This paper describes the implementation of a multi-channel audio codec for HETV. This codec has the features of the 3/2-stereo plus low frequency enhancement, downward compatibility with the smaller number of channels, backward compatibility with the existing 2/0-stereo system(MPEG-1 audio), and multilingual capability. The encoder of this codec consists of 6-channel analog audio input part with the sampling rate of 48 kHz, 4-channel digital audio input part and three TMS320C40 /DSPs. The encoder implements multi-channel audio compression using a human perceptual psychoacoustic model, and has the bit rate reduction to 384 kbit/s without impairment of subjective quality. The decoder consists of 6-channel analog audio output part, 4-channel digital audio output part, and two TMS320C40 DSPs for a decoding procedure. The decoder analyzes the bit stream received with bit rate of 384 kbit/s from the encoder and reproduces the multi-channel audio signals for analog and digital outputs. The multi-processing of this audio codec using multiple DSPs is ensured by high speed transfer of date between DSPs through coordinating communication port activities with DMA coprocessors. Finally, some technical considerations are suggested to realize the problem of real-time operation, which are found out through the implementation of this codec using the MPEG-2 layer II sudio coding algorithm and the use of the hardware architecture with commercial multiple DSPs.

  • PDF

Suboptimal video coding for machines method based on selective activation of in-loop filter

  • Ayoung Kim;Eun-Vin An;Soon-heung Jung;Hyon-Gon Choo;Jeongil Seo;Kwang-deok Seo
    • ETRI Journal
    • /
    • v.46 no.3
    • /
    • pp.538-549
    • /
    • 2024
  • A conventional codec aims to increase the compression efficiency for transmission and storage while maintaining video quality. However, as the number of platforms using machine vision rapidly increases, a codec that increases the compression efficiency and maintains the accuracy of machine vision tasks must be devised. Hence, the Moving Picture Experts Group created a standardization process for video coding for machines (VCM) to reduce bitrates while maintaining the accuracy of machine vision tasks. In particular, in-loop filters have been developed for improving the subjective quality and machine vision task accuracy. However, the high computational complexity of in-loop filters limits the development of a high-performance VCM architecture. We analyze the effect of an in-loop filter on the VCM performance and propose a suboptimal VCM method based on the selective activation of in-loop filters. The proposed method reduces the computation time for video coding by approximately 5% when using the enhanced compression model and 2% when employing a Versatile Video Coding test model while maintaining the machine vision accuracy and compression efficiency of the VCM architecture.

Non-Intrusive Speech Quality Estimation of G.729 Codec using a Packet Loss Effect Model (G.729 코덱의 패킷 손실 영향 모델을 이용한 비 침입적 음질 예측 기법)

  • Lee, Min-Ki;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.2
    • /
    • pp.157-166
    • /
    • 2013
  • This paper proposes a non-intrusive speech quality estimation method considering the effects of packet loss to perceptual quality. Packet loss is a major reason of quality degradation in a packet based speech communications network, whose effects are different according to the input speech characteristics or the performance of the embedded packet loss concealment (PLC) algorithm. For the quality estimation system that involves packet loss effects, we first observe the packet loss of G.729 codec which is one of narrowband codec in VoIP system. In order to quantify the lost packet affects, we design a classification algorithm only using speech parameters of G.729 decoder. Then, the degradation values of each class are iteratively selected that maximizes the correlation with the degradation PESQ-LQ scores, and total quality degradation is modeled by the weighted sum. From analyzing the correlation measures, we obtained correlation values of 0.8950 for the intrusive model and 0.8911 for the non-intrusive method.