• Title/Summary/Keyword: Scalable audio coding

Search Result 11, Processing Time 0.026 seconds

A Scalable Audio Coder for High-quality Speech and Audio Services

  • Lee, Gil-Ho;Lee, Young-Han;Kim, Hong-Kook;Kim, Do-Young;Lee, Mi-Suk
    • MALSORI
    • /
    • no.61
    • /
    • pp.75-86
    • /
    • 2007
  • In this paper, we propose a scalable audio coder, which has a variable bandwidth from the narrowband speech bandwidth to the audio bandwidth and also has a bit-rate from 8 to 320 kbits/s, in order to cope with the quality of service(QoS) according to the network load. First of all, the proposed scalable coder splits bandwidth of the input audio into narrowband up to around 4 kHz and above. Next, the narrowband signals are compressed by a speech coding method compatible to an existing standard speech coder such as G.729, and the other signals whose bandwidth is above the narrowband are compressed on the basis of a psychoacoustic model. It is shown from the objective quality tests using the signal-to-noise ratio(SNR) and the perceptual evaluation of audio quality(PEAQ) that the proposed scalable audio coder provides a comparable quality to the MPEG-1 Layer III (MP3) audio coder.

  • PDF

A survey of MPEG-4 scalable audio coding (MPEG-4 스케일러블 오디오 부호화 방식에 관한 관찰)

  • 김연배;박성희;서양석
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1999.06b
    • /
    • pp.137-142
    • /
    • 1999
  • 지금까지 알려진 오디오 부호화 방식중 실제로 스케일러블 부호화 방식은 없었다. MPEG-4에서는 스케일러블 부호화라는 새로운 기능을 중요한 요구사항으로 받아들여 개발하였다. 스케일러블 부호화(scalable coding) 기능이란 주어진 자원의 일부분을 가지고 의미 있는 정보를 재생할 수 있게 하는 부호화방식이다. 본 논문에서는 ISO MPEG-4에서 추진하고 있는 스케일러블 부호화방식인 large step scalable coding방식과 fine grain scalable coding방식에 대해 알아보고 그 응용분야에 대해 살펴본다.

  • PDF

Efficient Media Synchronization Mechanism for SVC Video Transport over IP Networks

  • Seo, Kwang-Deok;Jung, Soon-Heung;Kim, Jin-Soo
    • ETRI Journal
    • /
    • v.30 no.3
    • /
    • pp.441-450
    • /
    • 2008
  • The scalable extension of H.264, known as scalable video coding (SVC) has been the main focus of the Joint Video Team's work and was finalized at the end of 2007. Synchronization between media is an important aspect in the design of a scalable video streaming system. This paper proposes an efficient media synchronization mechanism for SVC video transport over IP networks. To support synchronization between video and audio bitstreams transported over IP networks, a real-time transport protocol/RTP control protocol (RTP/RTCP) suite is usually employed. To provide an efficient mechanism for media synchronization between SVC video and audio, we suggest an efficient RTP packetization mode for inter-layer synchronization within SVC video and propose a computationally efficient RTCP packet processing method for inter-media synchronization. By adopting the computationally simple RTCP packet processing, we do not need to process every RTCP sender report packet for inter-media synchronization. We demonstrate the effectiveness of the proposed mechanism by comparing its performance with that of the conventional method.

  • PDF

A Complexity Reduction Method of MPEG-4 Audio Lossless Coding Encoder by Using the Joint Coding Based on Cross Correlation of Residual (여기신호의 상관관계 기반 joint coding을 이용한 MPEG-4 audio lossless coding 인코더 복잡도 감소 방법)

  • Cho, Choong-Sang;Kim, Je-Woo;Choi, Byeong-Ho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.3
    • /
    • pp.87-95
    • /
    • 2010
  • Portable multi-media products which can service the highest audio-quality by using lossless audio codec has been released and the international lossless codecs, MPEG-4 audio lossless coding(ALS) and MPEG-4 scalable lossless coding(SLS), were standardized by MPEG in 2006. The simple profile of MPEG-4 ALS, it supports up to stereo, was defined by MPEG in 2009. The lossless audio codec should have low-complexity in stereo to be widely used in portable multi-media products. But the previous researches of MPEG-4 ALS have focused on an improvement of compression ratio, a complexity reduction in multi-channels coding, and a selection of linear prediction coefficients(LPCs) order. In this paper, the complexity and compression ratio of MPEG-4 ALS encoder is analyzed in simple profile of MPEG-4 ALS, the method to reduce a complexity of MPEG-4 ALS encoder is proposed. Based on an analysis of complexity of MPEG-4 ALS encoder, the complexity of short-term prediction filter of MPEG-4 ALS encoder is reduced by using the low-complexity filter that is proposed in previous research to reduce the complexity of MPEG-4 ALS decoder. Also, we propose a joint coding decision method, it reduces the complexity and keeps the compression ratio of MPEG-4 ALS encoder. In proposed method, the operation of joint coding is decided based on the relation between cross-correlation of residual and compression ratio of joint coding. The performance of MPEG-4 ALS encoder that has the method and low-complexity filter is evaluated by using the MPEG-4 ALS conformance test file and normal music files. The complexity of MPEG-4 ALS encoder is reduced by about 24% by comparing with MPEG-4 ALS reference encoder, while the compression ratio by the proposed method is comparable to MPEG-4 ALS reference encoder.

Method of scalable video application in the advanced T-DMB (지상파 DMB 고도화 망에서의 스케일러블 비디오 부호화 기술)

  • Jun, Dong-San;Kwak, Sang-Min;Lim, Hyung-Soo;Choi, Hae-Chul;Kim, Jae-Gon;Lim, Jong-Soo;Hong, Jin-Woo
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.44 no.1
    • /
    • pp.1-9
    • /
    • 2007
  • Digital Multimedia Broadcasting is the next generation broadcasting service which enables various digital multimedia contents, i.e., audio and video, and data access for mobile users. However, due to the bandwidth limitation, the spatial resolution is limited to CIF(Common Interleaved Frame). The Advanced Terrestrial DMB (AT-DMB) secures additional bandwidth by adopting hierarchical modulation transmission technology and provides high data rate and quality for mobile multimedia broadcasting services with scalable video coding(SVC). This paper proposes scalable video coding technology for AT-DMB which enables high quality mobile multimedia broadcasting services that exceeds current DMB service's quality and contents capability.

Efficient Generation of Scalable Transport Stream for High Quality Service in T-DMB

  • Kim, Kwang-Yong;Lee, Gwang-Soon;Lim, Jong-Soo;Lee, Soo-In;Kim, Duk-Gyoo
    • ETRI Journal
    • /
    • v.31 no.1
    • /
    • pp.65-67
    • /
    • 2009
  • We introduce an advanced terrestrial digital multimedia broadcasting (AT-DMB) system that overcomes the limitation of data transmission rates of T-DMB by doubling it with the same frequency bandwidth. In this letter, we propose an efficient algorithm which generates a scalable transport stream in AT-DMB by multiplexing certain types of elementary streams encoded using scalable video coding and an MPEG-surround audio coder for high-quality multimedia services.

  • PDF

Studies on Joint Source/Channel Coding for MPEG-4 Scalable Video Transmission in Mobile Broadcast Receiving Environments (이동방송수신환경에서 MPEG-4 계층적 비디오 전송을 위한 결합 소스/채널 부호화에 관한 연구)

  • Lee Woon-Moon;Sohn Won
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.3 s.303
    • /
    • pp.31-40
    • /
    • 2005
  • In this paper, we develop an approach toward JSC(Joint Source-Channel Coding) method for MPEG-4 based FGS(Fine Granular Scalability) video coding and transmission in fixed and mobile receiving environment(Digital Audio Broadcasting, DAB). The source coder used MPEG-4 FGS video codec, the channel coder used RCPC(Rate Compatible Punctured Convolution) code and the modulation method used QPSK modulation. We have considered channel environment of AWGN and mobile receiving environment. This study determined optimum Trade-off point between source bit rate and channel coding rate in variable channel states. We compared FGS-JSC method and general single layer CBR(Constant Bit Rate) transmission. In this results, FGS-JSC was appeared better performance than CBR transmission.

H.263-Based Scalable Video Codec (H.263을 기반으로 한 확장 가능한 비디오 코덱)

  • 노경택
    • Journal of the Korea Society of Computer and Information
    • /
    • v.5 no.3
    • /
    • pp.29-32
    • /
    • 2000
  • Layered video coding schemes allow the video information to be transmitted in multiple video bitstreams to achieve scalability. they are attractive in theory for two reasons. First, they naturally allow for heterogeneity in networks and receivers in terms of client processing capability and network bandwidth. Second, they correspond to optimal utilization of available bandwidth when several video qualify levels are desired. In this paper we propose a scalable video codec architectures with motion estimation, which is suitable for real-time audio and video communication over packet networks. The coding algorithm is compatible with ITU-T recommendation H.263+ and includes various techniques to reduce complexity. Fast motion estimation is Performed at the H.263-compatible base layer and used at higher layers, and perceptual macroblock skipping is performed at all layers before motion estimation. Error propagation from packet loss is avoided by Periodically rebuilding a valid Predictor in Intra mode at each layer.

  • PDF

A New Wideband Speech/Audio Coder Interoperable with ITU-T G.729/G.729E (ITU-T G.729/G.729E와 호환성을 갖는 광대역 음성/오디오 부호화기)

  • Kim, Kyung-Tae;Lee, Min-Ki;Youn, Dae-Hee
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.2
    • /
    • pp.81-89
    • /
    • 2008
  • Wideband speech, characterized by a bandwidth of about 7 kHz (50-7000 Hz), provides a substantial quality improvement in terms of naturalness and intelligibility. Although higher data rates are required, it has extended its application to audio and video conferencing, high-quality multimedia communications in mobile links or packet-switched transmissions, and digital AM broadcasting. In this paper, we present a new bandwidth-scalable coder for wideband speech and audio signals. The proposed coder spits 8kHz signal bandwidth into two narrow bands, and different coding schemes are applied to each band. The lower-band signal is coded using the ITU-T G.729/G.729E coder, and the higher-band signal is compressed using a new algorithm based on the gammatone filter bank with an invertible auditory model. Due to the split-band architecture and completely independent coding schemes for each band, the output speech of the decoder can be selected to be a narrowband or wideband according to the channel condition. Subjective tests showed that, for wideband speech and audio signals, the proposed coder at 14.2/18 kbit/s produces superior quality to ITU-T 24 kbit/s G.722.1 with the shorter algorithmic delay.

Design of 8K Broadcasting System based on MMT over Heterogeneous Networks

  • Sohn, Yejin;Cho, Minju;Paik, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.8
    • /
    • pp.4077-4091
    • /
    • 2017
  • This paper presents the design of a broadcasting scenario and system for an 8K-resolution content. Due to an 8K content is four times larger than the 4K content in terms of size, many technologies such as content acquisition, video coding, and transmission are required to deal with it. Therefore, high-quality video and audio for 8K (ultra-high definition television) service is not possible to be transmitted only using the current terrestrial broadcasting system. The proposed broadcasting system divides the 8K content into four 4K contents by area, and each area is hierarchically encoded by Scalable High-efficiency Video Coding (SHVC) into three layers: L0, L1, and L2. Every part of the 8K video content divided into areas and hierarchy is independently treated. These parts are transmitted over heterogeneous networks such as digital broadcasting and broadband networks after going through several processes of generating signal messages, encapsulation, and packetization based on MPEG media transport. We propose three methods of generating streams at the sending entity to merge the divided streams into the original content at the receiving entity. First, we design the composition information, which defines the presentation structure for displays. Second, a descriptor for content synchronization is included in the signal message. Finally, we define the rules for generating "packet_id" among the packet header fields and design the transmission scheduler to acquire the divided streams quickly. We implement the 8K broadcasting system by adapting the proposed methods and show that the 8K-resolution contents are stably received and serviced with a low delay.