• Title/Summary/Keyword: multichannel audio

Search Result 46, Processing Time 0.025 seconds

MPEG-4 ALS - The Standard for Lossless Audio Coding

  • Liebchen, Tilman
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.618-629
    • /
    • 2009
  • The MPEG-4 Audio Lossless Coding (ALS) standard belongs to the family MPEG-4 audio coding standards. In contrast to lossy codecs such as AAC, which merely strive to preserve the subjective audio quality, lossless coding preserves every single bit of the original audio data. The ALS core codec is based on forward-adaptive linear prediction, which combines remarkable compression with low complexity. Additional features include long-term prediction, multichannel coding, and compression of floating-point audio material. This paper describes the basic elements of the ALS codec with a focus on prediction, entropy coding, and related tools and points out the most important applications of this standardized lossless audio format.

A Frequency-Domain Normalized MBD Algorithm with Unidirectional Filters for Blind Speech Separation

  • Kim Hye-Jin;Nam Seung-Hyon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.2E
    • /
    • pp.54-60
    • /
    • 2005
  • A new multichannel blind deconvolution algorithm is proposed for speech mixtures. It employs unidirectional filters and normalization of gradient terms in the frequency domain. The proposed algorithm is shown to be approximately nonholonomic. Thus it provides improved convergence and separation performances without whitening effect for nonstationary sources such as speech and audio signals. Simulations using real world recordings confirm superior performances over existing algorithms and its usefulness for real applications.

Salience of Envelope Interaural Time Difference of High Frequency as Spatial Feature (공간감 인자로서의 고주파 대역 포락선 양이 시간차의 유효성)

  • Seo, Jeong-Hun;Chon, Sang-Bae;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.6
    • /
    • pp.381-387
    • /
    • 2010
  • Both timbral features and spatial features are important in the assessment of multichannel audio coding systems. The prediction model, extending the ITU-R Rec. BS. 1387-1 to multichannel audio coding systems, with the use of spatial features such as ITDDist (Interaural Time Difference Distortion), ILDDist (Interaural Level Difference Distortion), and IACCDist (InterAural Cross-correlation Coefficient Distortion) was proposed by Choi et al. In that model, ITDDistswere only computed for low frequency bands (below 1500Hz), and ILDDists were computed only for high frequency bands (over 2500Hz) according to classical duplex theory. However, in the high frequency range, information in temporal envelope is also important in spatial perception, especially in sound localization. A new model to compute the ITD distortions of temporal envelopes in high frequency components is introduced in this paper to investigate the role of such ITD on spatial perception quantitatively. The computed ITD distortions of temporal envelopes in high frequency components were highly correlated with perceived sound quality of multichannel audio sounds.

Online Monaural Ambient Sound Extraction based on Nonnegative Matrix Factorization Method for Audio Contents (오디오 컨텐츠를 위한 비음수 행렬 분해 기법 기반의 실시간 단일채널 배경 잡음 추출 기법)

  • Lee, Seokjin
    • Journal of Broadcast Engineering
    • /
    • v.19 no.6
    • /
    • pp.819-825
    • /
    • 2014
  • In this paper, monaural ambient component extraction algorithm based on nonnegative matrix factorization (NMF) is described. The ambience component extraction algorithm in this paper is developed for audio upmixing system; Recent researches have shown that they can enhance listener envelopment if the extracted ambient signal is applied into the multichannel audio upmixing system. However, the conventional method stores all of the audio signal and processes all at once, so it cannot be applied to streaming system and digital signal processor (DSP) system. In this paper, the ambient component extraction algorithm based on on-line nonnegative matrix factorization is developed and evaluated to solve the problem. As a result of analysis of the processed signal with spectral flatness measures in the experiment, it was shown that the developed system can extract the ambient signal similarly with the conventional batch process system.

Polyphonic sound event detection using multi-channel audio features and gated recurrent neural networks (다채널 오디오 특징값 및 게이트형 순환 신경망을 사용한 다성 사운드 이벤트 검출)

  • Ko, Sang-Sun;Cho, Hye-Seung;Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.36 no.4
    • /
    • pp.267-272
    • /
    • 2017
  • In this paper, we propose an effective method of applying multichannel-audio feature values to GRNNs (Gated Recurrent Neural Networks) in polyphonic sound event detection. Real life sounds are often overlapped with each other, so that it is difficult to distinguish them by using a mono-channel audio features. In the proposed method, we tried to improve the performance of polyphonic sound event detection by using multi-channel audio features. In addition, we also tried to improve the performance of polyphonic sound event detection by applying a gated recurrent neural network which is simpler than LSTM (Long Short Term Memory), which shows the highest performance among the current recurrent neural networks. The experimental results show that the proposed method achieves better sound event detection performance than other existing methods.

An Enhancement of the MPEG-2 Audio Encoder Using General DSPs (범용 DSP를 이용한 MPEG-2 오디오 부호화기의 성능 개선)

  • 오현오;김성윤;윤대희;차일환;이준용
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1997.11a
    • /
    • pp.63-67
    • /
    • 1997
  • The ISO(International Standard Organization) has standardized MPEG-2 audio. The MPEG-2 audio compression algorithm is based upon subband analysis and exploits the human auditory characteristics to achieve a low bit rate with minimum perceptual loss of audio signal quality. This thesis presents an enhanced MPEG-2 audio encoder using multiple TMS320C30 general purpose DSP's. The developed system is made up of five slave boards and one master board. Each slave board performs susband analysis psychoacoustic parameter calculation for one channel, and the master board manages bit allocation, quantization, and bit-stream formatting for all channels. Parallel processing and pipelining techniques are used in hardware structure and fast algorithms are applied in each subroutine to implement a real-time process. The implemented system supports multichannel up to 5.1 and various bitrates.

  • PDF

Implementation of Local Distribution Audio System Based on AoIP (AoIP 기반 지역분산형 오디오시스템의 구현)

  • Kang, Min-Soo;Lee, Sang-Wook;Park, Yeoun-Sik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.12
    • /
    • pp.2165-2170
    • /
    • 2008
  • In this parer, it is implemented a Local distribution Audio System, based on AoIP(Audio over Internet Protocol) of a part of TCP/IP Network which belongs to Internet transmission technology. The system is controlled based on SNMP(Simple Network Management Protocol) and it is transferred to UDP as packet after changing from Analog audio sources to Digital audio sources. The implemented Local distribution Audio System have presented practical possibilities in PA system transmitting various audio sources to several areas, dispersedly and using multichannel audio like Home theaters in the limelight, recently.

An efficient multichannel spatial audio coding method based on inter channel correlation (채널상관성에 기반한 효율적인 멀티채널 spatial audio coding 방법)

  • Lee Byonghwa;Beack Seungkwon;Seo Jeongil;Hahn Minsoo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.157-160
    • /
    • 2004
  • Spatial Audio Coding 방법 중 하나인 Binaural Cue Coding 방법은 다채널 다객체 오디오 신호를 모노나 스테레오로 다운 믹스한 신호와 spatial 큐를 전송해 디코더에서 복원하는 기술로 작은 비트 율로 다채널 오디오 신호를 전송 복원해 내는 기술이다. 본 논문은 BCC 코딩 방법에서 채널 상관도를 나타내는 ICC 파라메터에 따라 spatial cue 종류를 달리함으로써 전송되는 부가정보의 비트 율을 줄이는 방법을 제안한다.

  • PDF

The Present State and Future Prospect on Standardization about Multichannel Audio - on the viewpoint of ITU-R (멀티채널 오디오 표준화 현황과 전망 - ITU-R을 중심으로)

  • Yoo, Jae-hyoun;Lee, Taejin;Kang, Kyeongok
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.11a
    • /
    • pp.147-150
    • /
    • 2013
  • UHDTV 시스템의 개발로 오디오에서도 보다 몰입감 있는 멀티채널 오디오 시스템에 대한 관심이 높아지고 있다. 이에, 과거 ITU-R에서 표준이 제정되어 극장 및 HDTV 등에 폭넓게 활용되어 온 5.1채널 대비 elevation 채널을 포함한 더 많은 채널 수를 사용하여 청취자에게 궁극의 몰입감을 줄 수 있는 멀티채널 오디오 시스템이 여러 표준화 단체를 통해 논의되고 있다. 이에 본 논문에서는 ITU-R에서 이루어지는 Advanced Multichannel Stereophonic Sound System 표준화 논의를 중점적으로 살펴보기 위하여, ITU-R의 구성과 현재까지 논의된 이슈 및 앞으로의 전망 등에 대해서 논하고자 한다.

  • PDF

A Performance Assessment of Real-time Multichannel Audio Codec

  • Kim, Sunghan;Jang, Daeyoung;Hong, Jinwoo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3E
    • /
    • pp.56-61
    • /
    • 1997
  • In this paper, we describe a real-time implementation of a multi-channel auido codec system that is based on the MPEG-1 audio algorithm. The major feature of this system is that it has a flexible multi-DSP system that can be adapted for various applications with using up to four TMS320C40 DSPs. The purpose of this paper is to present the problems of the system and is to describe the optimized methods to solve the problems in the view of hardware and software. Our audio codec is composed of an encoder an a decoder system and the bit rate of bitstream is up to 384 kbps. Fast input/output interfaces, DSP overloads, and inter-DSP communications methods with high speed are considered in multi-DSP H/W. Also, to run real-time in S/W, optimizing methods of algorithm are considered. After implementation of system, the subjective assessment method, and 'triple stimulus/hidden reference/double blind' that recommended by ITU-R TG10/3 is adopted for the quality of our system. All test items except one are awarded difference grades(diffgrade) better than 1-. Form the results, multi-channel audio system can be used for HDTV service.

  • PDF