• Title/Summary/Keyword: 공간 오디오 부호화

Search Result 11, Processing Time 0.083 seconds

Complex Spatial Cue based Channel Audio Coding (복소 공간큐를 활용한 다채널 오디오 코딩 기술)

  • Beack, Seungkwon;Lim, Wootaek;Lee, Taejin
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.58-60
    • /
    • 2022
  • 본 논문에서는 복소(complex) 공간큐를 활용한 다채널 오디오 부호화 기술을 제안한다. 복소 공간큐 방식의 다채널 오디오 부호화 기술은 시간영역에서 수행된다. 시간영역의 오디오 채널 신호를 복소 데이터로 변환하여 각 오디오 채널 간의 상관관계를 복소 공간큐로 표현하고, 이를 활용하여 채널 부호화를 수행하기 위한 오디오 채널 신호를 생성한다. 참조 기술로는 최고 성능의 오디오 코덱인 USAC의 예측 부호화 방식의 다채널 오디오 부호화 기술과 비교하여 정보량 감축 측면에 있어서 평균 2.24 dB 이상의 높은 SNR을 나타냄을 관측할 수 있었다.

  • PDF

MPEG Surround for Multi-Channel Audio Coding-Part 1: Basic Structure (다채널 오디오 코딩을 위한 MPEG Surround-1부: 기본 구조)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.599-609
    • /
    • 2009
  • An overview of the recently finalized multi-channel audio coding standard MPEG Surround is provided. This audio coding standard downmixes multi-channel signals to mono or stereo signals and, simultaneously, extracts spatial parameters for its encoding process. In its decoding process, it reconstructs multi-channel signals based on the downmix signals and spatial parameters. Since the downmix signals are coded in conventional audio coding format such as AAC and MP3 and the spatial parameters require a small amount of information MPEG Surround guarantees high sound quality multi-channel audio at low bit rates. Besides, it is backward-compatible to conventional audio coding techniques because the downmix signals can be played on portable audio devices ignoring the spatial parameter information. In this paper, Part 1 presents an overview of the basic structure of MPEG Surround and Part 2 describes various modes and tools including the binaural mode which supports the virtual 5.1-channel playback via headphones or earphones. The listening test results by various companies and organizations are also presented.

Design of An MPEG-2 Audio Encoder Chip (MPEG-2 오디오 부호화기 설계)

  • 정남훈
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.205-208
    • /
    • 1998
  • 본 논문에서는 VLSI 기술에 바탕을 둔 top-down 접근 방식에 의하여 MPEG-2 오디오 부호화 알고리듬을 구현하였다. MPEG-2 오디오 부호화기의 알고리듬은 많은 연산량을 갖고 이질적인 특성을 갖고 이질적인 특성을 갖는 알고리듬들이 복합적으로 존재한다. 그러므로, 부호화기를 효과적으로 구현하기 위해서는 알고리듬 수준에서 구조적 수준에 이르기까지 많은 고찰이 이루어져야 한다. 본 논문에서는 우선 전체 부호화 알고리듬을 분석하여 이들을 다시 작업이라고 정의된 작은 부-알고리듬으로 나누었다. 다음으로, 분할된 작업들은 시간과 공간을 초대한 활용할 수 있도록 적절한 작업 순서를 부여하고, 좀 더 큰 모듈들로 모으는 클러스터링을 수행하였다. 마지막으로 이러한 분석 결과를 바탕으로, 실시간으로 동작하는 5.1 채널 MPEG-2 오디오 부호화기를 설계하였다. 설계된 시스템은 두 개의 하드웨어 블록과 한 개의 ASIP형 DSP 프로세서를 갖는 이질적인 다중 프로세서의 형태를 갖는다. 설계된 오디오 부호화기는 0.6$\mu\textrm{m}$ 표준 셀 기술을 이용하여 단일 칩으로 제작되었으며, PC에 탑재 가능한 시험 기판을 제작하여 동작을 검증하였다.

  • PDF

Improved Synthesis Method of Negative Inter-channel Correlation Parameter Based on Anti-phase Primary Component (반위상 주요성분에 기반을 둔 개선된 음수 채널간 상관도 파라미터 합성 기법)

  • Hyun, Dong-Il;Lee, Seok-Pil;Park, Young-Cheol;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.6
    • /
    • pp.410-418
    • /
    • 2012
  • Parametric stereo(PS) and MPEG surround(MPS) are major spatial audio coding(SAC) tools. In this paper, the problem of the inter-channel correlation(ICC) synthesis in the conventional SAC is analyzed. Conventional methods assume that ambient components mixed to two output channels are anti-phased, while the primary components are assumed to be in-phased. This assumption can cause excessive ambient mixing for a negative-valued ICC. As a remedy to this problem, we propose a new ICC synthesis method based on an assumption that the primary components are anti-phased each other for a negative ICC. The proposed method is also applied to the approximation which works in practice. The performance of the proposed method was evaluated by computer simulations and the subjective listening tests verified that the proposed method is effective in not only headphones but also loudspeakers playback.

An Audio Coding Technique Employing the Inter-channel Phase Difference Skip (채널 간 위상차 파라미터 생략 기법을 이용한 오디오 부호화)

  • Kim, Hyun-Hwi;Kim, Rin-Chul
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2015.07a
    • /
    • pp.3-4
    • /
    • 2015
  • 본 논문에서는 공간 오디오 부호화 기법인 MPEG 서라운드에서 공간 파라미터 전송 시 위상 파라미터를 생략하는 기법에 대해 다룬다. 기존 방법에서는 한 프레임이 모두 적은 위상차를 가지는 경우에도 정상적으로 처리하여 전송한다. 이러한 경우 위상차 파라미터를 생략하여 비트 효율을 향상시킬 수 있다. 스테레오 복원 과정에서 발생하는 채널 간 시간차에 기반해 설계된 양자화기를 생략 기법에 적용하면 기존에 비해 평균적으로 40 ~ 50% 정도의 위상 파라미터 절감 효과를 얻을 수 있다.

  • PDF

Standardization of MPEG-I Immersive Audio and Related Technologies (MPEG-I Immersive Audio 표준화 및 기술 동향)

  • Jang, D.Y.;Kang, K.O.;Lee, Y.J.;Yoo, J.H.;Lee, T.J.
    • Electronics and Telecommunications Trends
    • /
    • v.37 no.3
    • /
    • pp.52-63
    • /
    • 2022
  • Immersive media, also known as spatial media, has become essential with the decrease in face-to-face activities in the COVID-19 pandemic era. Teleconference, metaverse, and digital twin have been developed with high expectations as immersive media services, and the demand for hyper-realistic media is increasing. Under these circumstances, MPEG-I Immersive Media is being standardized as a technologies of navigable virtual reality, which is expected to be launched in the first half of 2024, and the Audio Group is working to standardize the immersive audio technology. Following this trend, this article introduces the trend in MPEG-I immersive audio standardization. Further, it describes the features of the immersive audio rendering technology, focusing on the structure and function of the RM0 base technology, which was chosen after evaluating all the technologies proposed in the January 2022 "MPEG Audio Meeting."

Improved Phase Synthesis for Parametric Stereo Audio Coding (파라메트릭 스테레오 오디오 부호화를 위한 향상된 위상 합성 기법)

  • Hyun, Dong-Il;Park, Young-Cheol;Youn, Dae Hee
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.12
    • /
    • pp.184-190
    • /
    • 2013
  • Parametric stereo(PS) audio coding is a specific version of spatial audio coding. In this paper, the problem due to the conventional synthesis of phase differences. In the conventional upmix matrix, phase differences are synthesized not only on downmix signal but also ambient signal, which violates the assumption that the ambient signals are anti-phased. Deterioration due to the phase synthesis is analyzed, especially, for low interchannel correlation. To solve this problem, new upmix matrix is proposed, which synthesizes phase differences only on downmix signal. The performance of the proposed upmix matrix is verified by the subjective listening tests.

An Audio Coding Technique Employing the Inter-channel Phase Difference Skip (채널 간 위상차 파라미터 생략 기법을 이용한 오디오 부호화)

  • Kim, Hyun-Hwi;Kim, Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.369-379
    • /
    • 2016
  • This paper deals with an efficient method for skipping inter-channel phase differences (IPD) in the MPEG surround of the unified speech and audio coding (USAC). Based on the psycho-acoustic sensitivity on the IPD, we estimate a threshold on IPD, below which we can not notice degradation in spatial cue. We propose an IPD skip method, in which any IPDs within the threshold are set to zero and are not transmitted. The proposed IPD skip method gives about 38% savings in terms of bit amount for IPD. Nevertheless, in the MUSHRA test, the proposed method does not show any noticeable degradation in the decoded audio quality.

Similar Movie Contents Retrieval Using Peak Features from Audio (오디오의 Peak 특징을 이용한 동일 영화 콘텐츠 검색)

  • Chung, Myoung-Bum;Sung, Bo-Kyung;Ko, Il-Ju
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.11
    • /
    • pp.1572-1580
    • /
    • 2009
  • Combing through entire video files for the purpose of recognizing and retrieving matching movies requires much time and memory space. Instead, most current similar movie-matching methods choose to analyze only a part of each movie's video-image information. Yet, these methods still share a critical problem of erroneously recognizing as being different matching videos that have been altered only in resolution or converted merely with a different codecs. This paper proposes an audio-information-based search algorithm by which similar movies can be identified. The proposed method prepares and searches through a database of movie's spectral peak information that remains relatively steady even with changes in the bit-rate, codecs, or sample-rate. The method showed a 92.1% search success rate, given a set of 1,000 video files whose audio-bit-rate had been altered or were purposefully written in a different codec.

  • PDF

A Real Time 6 DoF Spatial Audio Rendering System based on MPEG-I AEP (MPEG-I AEP 기반 실시간 6 자유도 공간음향 렌더링 시스템)

  • Kyeongok Kang;Jae-hyoun Yoo;Daeyoung Jang;Yong Ju Lee;Taejin Lee
    • Journal of Broadcast Engineering
    • /
    • v.28 no.2
    • /
    • pp.213-229
    • /
    • 2023
  • In this paper, we introduce a spatial sound rendering system that provides 6DoF spatial sound in real time in response to the movement of a listener located in a virtual environment. This system was implemented using MPEG-I AEP as a development environment for the CfP response of MPEG-I Immersive Audio and consists of an encoder and a renderer including a decoder. The encoder serves to offline encode metadata such as the spatial audio parameters of the virtual space scene included in EIF and the directivity information of the sound source provided in the SOFA file and deliver them to the bitstream. The renderer receives the transmitted bitstream and performs 6DoF spatial sound rendering in real time according to the position of the listener. The main spatial sound processing technologies applied to the rendering system include sound source effect and obstacle effect, and other ones for the system processing include Doppler effect, sound field effect and etc. The results of self-subjective evaluation of the developed system are introduced.