• Title/Summary/Keyword: multichannel audio

Search Result 46, Processing Time 0.072 seconds

Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding

  • Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Hong, Jin-Woo
    • ETRI Journal
    • /
    • v.31 no.4
    • /
    • pp.365-375
    • /
    • 2009
  • In this paper, a terrestrial digital multimedia broadcasting (T-DMB) multichannel audio broadcasting system based on spatial audio coding is presented. The proposed system provides realistic multichannel audio service via T-DMB with a small increase of data rate as well as backward compatibility with the conventional stereo-based T-DMB player. To reduce the data rate for additional multichannel audio signals, we compress the multichannel audio signals using the sound source location cue coding algorithm, which is an efficient parametric multichannel audio compression technique. For compatibility, we use the dependent property of an elementary stream descriptor, and this property should be ignored in a conventional T-DMB player. To verify the feasibility of the proposed system, we implement the T-DMB multichannel audio encoder and a prototype player. We perform a compatibility test using the T-DMB multichannel audio encoder and conventional T-DMB players. The test demonstrates that the proposed system is compatible with a conventional T-DMB player and that it can provide a promisingly rich audio service.

Study on novel hierarchical parametric stereo coding method for Multichannel audio signal (멀티채널 오디오 신호의 계층적 코딩이 가능한 파라메트릭 스테레오 코딩 방법에 대한 연구)

  • Moon, Han-Gil
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.875-876
    • /
    • 2008
  • Parametric stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psycho-acoustical principles. However, coding of multichannel audio signal using parametric stereo still requires considerable bit-rate. In this paper, enhanced parametric stereo coding for multichannel audio signal is proposed.

  • PDF

Acoustic Event Detection in Multichannel Audio Using Gated Recurrent Neural Networks with High-Resolution Spectral Features

  • Kim, Hyoung-Gook;Kim, Jin Young
    • ETRI Journal
    • /
    • v.39 no.6
    • /
    • pp.832-840
    • /
    • 2017
  • Recently, deep recurrent neural networks have achieved great success in various machine learning tasks, and have also been applied for sound event detection. The detection of temporally overlapping sound events in realistic environments is much more challenging than in monophonic detection problems. In this paper, we present an approach to improve the accuracy of polyphonic sound event detection in multichannel audio based on gated recurrent neural networks in combination with auditory spectral features. In the proposed method, human hearing perception-based spatial and spectral-domain noise-reduced harmonic features are extracted from multichannel audio and used as high-resolution spectral inputs to train gated recurrent neural networks. This provides a fast and stable convergence rate compared to long short-term memory recurrent neural networks. Our evaluation reveals that the proposed method outperforms the conventional approaches.

Low-bitrate Multichannel Audio Coding (저비트율 멀티채널 오디오 부호화)

  • Jang, Inseon;Seo, Jeongil;Beak, Seungkwon;Kang, Kyeongok
    • Journal of Broadcast Engineering
    • /
    • v.10 no.3
    • /
    • pp.328-338
    • /
    • 2005
  • Technology for compressing low-bitrate multichannel audio coding is being standardized owing to the increasing need of consumer for multichannel audio contents. In this paper we propose the sound source location cue coding (SSLCC) for extremely compressing multichannel audio to be suitable at the narrow bandwidth transmission environment. To improve the compression capability of the conventional binaural cue coding(BCC), the SSLCC adopts the virtual source location information (VSLI) as a spatial cue parameter, a symmetric uniform quantizer, and Huffman coder. The objective and subjective assessment results show that the SSLCC provides lower bitrate and better audio quality than conventional BCC method.

Audio Source Separation Method based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part II: A Study on the Beamspace Transform Algorithms (빔공간-영역 다채널 비음수 행렬 분해 알고리즘을 이용한 음원 분리 기법 Part II: 빔공간-변환 기법에 대한 고찰)

  • Lee, Seok-Jin;Park, Sang-Ha;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.5
    • /
    • pp.332-339
    • /
    • 2012
  • Beamspace transform algorithm transforms spatial-domain data - such as x, y, z dimension - into incidence-angle-domain data, which is called beamspace-domain data. The beamspace transform method is generally used in source localization and tracking, and adaptive beamforming problem. When the beamspace transform method is used in multichannel audio source separation, the inverse beamspace transform is also important because the source image have to be reconstructed. This paper studies the beamspace transform and inverse transform algorithms for multichannel audio source separation system, especially for the beamspace-domain multichannel NMF algorithm.

An efficient method of spatial cues and compensation method of spectrums on multichannel spatial audio coding (멀티채널 Spatial Audio Coding에서의 효율적인 Spatial Cues 사용과 그에 따른 Spectrum 보상방법)

  • Lee, Byong-Hwa;Beack, Seung-Kwon;Seo, Jeong-Gil;Han, Min-Soo
    • MALSORI
    • /
    • no.53
    • /
    • pp.157-169
    • /
    • 2005
  • This paper proposes an efficiently representing method of spatial cues on multichannel spatial audio coding. The Binaural Cue Coding (BCC) method introduced recently represents multichannel audio signals by means of Inter Channel Level Difference (ICLD) or Source Index (SI). We tried to express more efficiently ICLD and SI information based on Inter Channel Correlation in this paper. We adopt different spatial cues according to ICC and propose a compensation method of empty spectrums created by using SI. We performed a MOS test and measuring spectral distortion. The results show that the proposed method can reduce the bitrate of side information without large degradation of the audio quality.

  • PDF

Design and Implementation of a CX23880 based PCI Multichannel Video/Audio Capture Device (CX23880 기반 PCI 다채널 비디오/오디오 캡쳐 장치 설계 및 구현)

  • 백승걸;홍진기;정선태
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2148-2151
    • /
    • 2003
  • In this paper, we present our experiences in designing and implementing a CX23880 based multichannel video/audio capture device. We try to clarify differences between CX2388x family and 878A, the previous version of Cx2388x, and what one needs to be careful about in developing device drivers for CX2388x based video/audio devices. Our work is expected to help one who will need to develop Cx2388x based video/audio device later.

  • PDF

Audio Source Separation Method Based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part I: Beamspace-domain Multichannel Non-negative Matrix Factorization system (빔공간-영역 다채널 비음수 행렬 분해 알고리즘을 이용한 음원 분리 기법 Part I: 빔공간-영역 다채널 비음수 행렬 분해 시스템)

  • Lee, Seok-Jin;Park, Sang-Ha;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.5
    • /
    • pp.317-331
    • /
    • 2012
  • In this paper, we develop a multichannel blind source separation algorithm based on a beamspace transform and the multichannel non-negative matrix factorization (NMF) method. The NMF algorithm is a famous algorithm which is used to solve the source separation problems. In this paper, we consider a beamspace-time-frequency domain data model for multichannel NMF method, and enhance the conventional method using a beamspace transform. Our decomposition algorithm is applied to audio source separation, using a dataset from the international Signal Separation Evaluation Campaign 2010 (SiSEC 2010) for evaluation.

Overview of MPEG 3D Audio Standard Activities for High-Order Multichannel Realistic Audio Service (고차 다채널 실감 오디오 서비스를 위한 MPEG 3D Audio 표준화 동향)

  • Seo, Jeongil;Kang, Kyeongok;Jeong, Dae-Gwon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.171-173
    • /
    • 2012
  • 본 논문에서는 최근 MPEG 오디오 서브그룹에서 활발히 논의 중인 3D Audio 표준화 동향에 대해서 소개하고, 관련한 국내외 기관들의 기술개발 현황에 대해서 알아본다. MPEG 3D Audio 는 NHK 22.2 채널방송과 같은 실감 오디오 서비스를 고다채널(High-Order Multichannel)로 특징짓고, 이러한 서비스를 위한 다채널 오디오 부호화 및 복호화 기술과 다양한 출력채널 환경에 적응할 수 있는 렌더링(rendering) 기술을 표준화 대상으로 규정하고 있다.

  • PDF

Multichannel Audio Distribution through the IEEE 1394 Protocol. -A Practical Approach-

  • Lucas Jose Soler;Hong Jin Woo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.59-62
    • /
    • 2000
  • The aim of this paper is to describe the current state of convergence of different kinds of networks in the home environment. In such a realm the 1394IEEE Protocol displays itself as the best player between other different technologies. A description of this high-speed protocol is provided. Finally, in this paper we suggest a prototype for multichannel audio distribution using IEEE 1394 and describe the development of the prototype elements.

  • PDF