• Title/Summary/Keyword: MPEG Audio

Search Result 322, Processing Time 0.03 seconds

An Implementation of Interactive 3D Audio Broadcasting Terminal (대화형 3차원 오디오 방송단말 구현)

  • Park Gi Yoon;Lee Taejin;Kang Kyeongok
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2004.11a
    • /
    • pp.211-214
    • /
    • 2004
  • 본 논문에서는 사용자의 입력에 따라 3차원 오디오 장면을 재구성하여 전달할 수 있는 대화형 오디오 방송단말의 구현 예를 제시한다. MPEG-4 AudioBIFS 규격에 따라 계층적으로 표현한 오디오 장면의 속성을 사용자의 입력에 따라 갱신하고, 주어진 속성을 참조하여 오디오 데이터를 3차원 공간상에 재합성하는 방식을 취한다 속성을 갱신하는 모듈은 MPEG-4 Audio 프로파일을 지원하게 하되 AudioBIFS 노드 유형에 따른 사용자 인터페이스를 미리 정의하여 단말 측에 저장해 두고 이용함으로써 대화형 방송 서비스를 구현했다. 3차원 오디오 데이터를 재생하는 기능은 사용자의 입력에 대한 피드백을 풍부하게 하여 대화형 방송의 효과를 극대화하고, 사실감을 제고하는 데 중요한 역할을 담당한다. 요소기술로 음상의 위치, 지향성, 모양, 잔향특성 등을 구현하기 위한 3차원 오디오 기술에 대해 소개한다. 또한 대화형 3차원 오디오 방송단말을 이용한 서비스의 예로 대화형 합주 및 합창 프로그램을 소개한다.

  • PDF

Implementation of a Person Tracking Based Multi-channel Audio Panning System for Multi-view Broadcasting Services (다시점 방송 서비스를 위한 사용자 위치추적 기반 다채널 오디오 패닝 시스템 구현)

  • Kim, Yong-Guk;Yang, Jong-Yeol;Lee, Young-Han;Kim, Hong-Kook
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.150-157
    • /
    • 2009
  • In this paper, we propose a person tracking based multi-channel audio panning system for multi-view broadcasting services. Multi-view broadcasting is to render the video sequences that are captured from a set of cameras based on different viewpoints, and multi-channel audio panning techniques are necessary for audio rendering in these services. In order to apply such a realistic audio technique to this multi-view broadcasting service, person tracking techniques which are to estimate the position of users are also necessary. For these reasons, proposed methods are composed of two parts. The first part is a person tracking method by using ultrasonic satellites and receiver. We could obtain user's coordinates of high resolution and short duration about 10 mm and 150 ms. The second part is MPEG Surround parameter-based multi-channel audio panning method. It is a method to obtain panned multi-channel audio by controlling the MPEG Surround spatial parameters. A MUSHRA test is conducted to objectively evaluate the perceptual quality and measure localization performance using a dummy head. From the experiments, it is shown that the proposed method provides better perceptual quality and localization performance than the conventional parameter-based audio panning method. In addition, we implement the prototype of person tracking based multi-view broadcasting system by integrating proposed methods with multi-view display system.

  • PDF

Efficient Multiplex Audio Monitoring System in Digital Broadcasting (디지털 방송에서 효율적인 다중 오디오 모니터링 시스템)

  • Kim, Yoo-Won;Sohn, Surg-Won;Jo, Geun-Sik
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.91-98
    • /
    • 2008
  • In digital broadcasting, it is possible to multiplex maximum one hundred audio or music programs into MPEG-2 transport stream, which is suitable for transmitting through one channel. In order to check if multiplex music programs are transmitted well, we need a multiplex audio monitoring system that monitors the programs in real-time. In analog broadcasting, we have used hardware-based audio monitoring system for a small number music programs. However, the effectiveness of hardware-based audio monitoring system from the cost and function viewpoint is so low that a new system is needed for digital broadcasting. In this paper, we have designed and implemented a software-based audio monitoring system to satisfy these requirements. In this implementation, only one PC is used without other hardware facilities, and the system monitors digital broadcasting music programs effectively. Transmitted digital broadcasting streams are demultiplexed into many music programs and the realtime value of audio level and packet error information for these programs are displayed in the screen. Thus, the system detects and shows the abnormal transmitting programs automatically. Simulation results show that effective realtime multiplex audio monitoring is possible for digital broadcasting music programs.

  • PDF

Audio Streaming Technology for Internet Audio Broadcasting (인터넷 오디오 방송을 위한 오디오 스트리밍 기술)

  • Kang Kyeongok;Hong Jin-Woo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.347-350
    • /
    • 2000
  • 인터넷을 이용한 디지털 오디오 방송 서비스에 대한 관심이 집중되면서 디지털 오디오 데이터를 서버로부터 사용자에게까지 실시간으로 전송하기 위한 연구가 진행되고 있다. 디지털 오디오 방송의 실시간 전송을 위하여 효율적인 오디오 압축 기술의 개발도 중요하지만 이들 오디오 압축 기술과 연계되는 오디오 스트리밍 기술이 매우 중요하다. 본 논문에서는 현재 사용되고 인터넷 오디오 방송관련 기술을 분석하고, 특히, IETF에서 논의되고 있는 MPEG-2AAC 및 MPEG-4 오디오를 인터넷을 통하여 전송하기 위한 RTP payload 포맷을 분석하고, 기술개발을 위한 고려사항을 제안한다.

  • PDF

An Implementation of MP3 Audio Player using IBM PC CD-ROM (IBM PC에서 독립적으로 작동하는 MP3 오디오 Player의 구현)

  • 안광삼;황희융
    • Proceedings of the KAIS Fall Conference
    • /
    • 2000.10a
    • /
    • pp.194-198
    • /
    • 2000
  • 최근 저장 장치로 많이 사용되고 있는 IBM PC의 CD-ROM(Compact Disc Read Only Memory)과 MP3(MPEG-1 Layer Ⅲ Audio) 디코더 칩을 이용하여 PC 작동과 관계없이 독립적으로 작동하는 CD-ROM 기반의 MP3 Player를 구현하였다. 여기서는 CD-ROM의 규격과 CD-ROM에 사용되는 ATAPI(AT Attachment Packet Interface) Format, MPEG-1 Layer Ⅲ의 오디오 부분에 대하여 알아보고 MP3 디코더 칩을 사용하여 CD-ROM에서 읽은 MP3 데이터즐 재생하는 방법을 취하였다. 이리하여 PC 자체로 MP3를 작동시키는 부하를 경감시키는 효과를 얻었다.

The Design of Object-based 3D Audio Broadcasting System (객체기반 3차원 오디오 방송 시스템 설계)

  • 강경옥;장대영;서정일;정대권
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.7
    • /
    • pp.592-602
    • /
    • 2003
  • This paper aims to describe the basic structure of novel object-based 3D audio broadcasting system To overcome current uni-directional audio broadcasting services, the object-based 3D audio broadcasting system is designed for providing the ability to interact with important audio objects as well as realistic 3D effects based on the MPEG-4 standard. The system is composed of 6 sub-modules. The audio input module collects the background sound object, which is recored by 3D microphone, and audio objects, which are recorded by monaural microphone or extracted through source separation method. The sound scene authoring module edits the 3D information of audio objects such as acoustical characteristics, location, directivity and etc. It also defines the final sound scene with a 3D background sound, which is intended to be delievered to a receiving terminal by producer. The encoder module encodes scene descriptors and audio objects for effective transmission. The decoder module extracts scene descriptors and audio objects from decoding received bistreams. The sound scene composition module reconstructs the 3D sound scene with scene descriptors and audio objects. The 3D sound renderer module maximizes the 3D sound effects through adapting the final sound to the listner's acoustical environments. It also receives the user's controls on audio objects and sends them to the scene composition module for changing the sound scene.

Multi-View Point switch System Structure & Implementation of Video player in MPEG-4 based (MPEG-4 시스템 기반의 다시점 전환 시스템 구조 및 재생기 구현)

  • Lee, Jun-Cheol;Lee, Jung-Won;Chang, Yong-Seok;Kim, Sung-Ho
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.44 no.1
    • /
    • pp.80-93
    • /
    • 2007
  • This paper suggests structures of the Object Descriptor and the Elementary Stream Descriptor that provide multi-view video services in 3-Dimensional Audio Video technical standards of current MPEG-4. First, it defines that the structures of the Object Descriptor and the Elementary Stream Descriptor on established MPEG-4 system, then distributes individually, and analyzes that. But extension of established system is inappropriate for providing multi-view audio video services connected transmissions and receptions. And, this paper suggests a structure of new Object Descriptor able to switch viewpoints that considers the correlation between each viewpoints, when multi-view video is transmitted. By means of that, it is able to switch viewpoints according to a requirement of a user in a multi-view video services, and reduce overheads for transmitting information about necessary viewpoint.

A Study on Common Synthesis Filter Architecture for MPEG-2 BC and AAC Audio (MPEG-2 BC/AAC 오디오 공용 합성 필터 구조에 관한 연구)

  • 강명수;박세기;오신범;이채욱
    • Proceedings of the IEEK Conference
    • /
    • 2003.11a
    • /
    • pp.73-76
    • /
    • 2003
  • 본 논문에서는 MPEG-2 BC와 AAC의 복호화 과정 중 함성 필터링 과정의 알고리듬을 분석하여 공동된 구조로 연산을 수행한 수 있는 광용 합성 필터 구조에 대하여 논하였다. 제안된 공용 합성 필터 구조는 Regressive 구조를 이용하여 MPEG-2 BC와 AAC의 복호화를 효과적으로 공용 수행하도록 하였다. 제안한 구조는 FFT를 사용할 경우에 필요한 전처리 및 후처리 과정을 고려해주지 않아도 되고 복소수 연산이 아닌 실수연산이 되어 하드웨어 구조가 단순하게 된다. 또한 MPEG-2 AAC의 다양한 윈도우 변환에도 안정적으로 연산되는 구조임을 확인하였다.

  • PDF

Retrieval of Broadcast News Using Audio Content Analysis

  • Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.3E
    • /
    • pp.74-79
    • /
    • 2007
  • In this paper, we report our recent work on a indexing and retrieval system of broadcast news using audio content analysis. Key issues addressed in this work are two major parts of the audio indexing system: anchorperson detection based on audio segmentation, and phone-based spoken document retrieval, developed in the framework of the emerging MPEG-7 standard. Experiments are conducted on a database of Britisch broadcast news videos. We discuss the development of the retrieval system, and the evaluation of each part and the retrieval system.

An Audio Watermarking Technique Using BPSK with Variable Carrier Frequency (가변 반송파 BPSK를 이용한 오디오 워터마킹 기법)

  • 이형욱;박세형;문용민;한상우;신재호
    • Proceedings of the IEEK Conference
    • /
    • 2000.06d
    • /
    • pp.110-113
    • /
    • 2000
  • In this paper, we consider the problem of digital audio watermarking to robust about compression without original audio data. We specifically address the audio watermarking using BPSK with variable carrier frequency. This technique make audio data embeded watermarking robust with compression attack, for example MPEG, AC-3, etc.

  • PDF