• Title/Summary/Keyword: 객체기반 오디오 서비스

Search Result 28, Processing Time 0.021 seconds

Interactive Synchornization Mechanism based on the Petri Net for the Stream Transmission (스트림 전송을 위한 패트리 넷 기반의 상호대화형 동기화 기법)

  • Lee, Yang-Min;Lee, Jae-Kee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2001.10b
    • /
    • pp.1517-1520
    • /
    • 2001
  • 과거의 컴퓨터를 이용한 미디어 서비스는 사용자에게 단순히 비디오, 오디오, 텍스트 등의 미디어를 일방적으로 전달하였으나 현재의 서비스는 사용자와의 상호대화 및 필요한 미디어 만을 선택해서 전달할 수 있는 방식을 요구한다. 이러한 응용을 위해서 각 미디어 파일들을 분리하여 전달하는 방식이 필요하며 동기화와 더불어 상호대화성 이라는 두 가지 문제를 해결해야 한다. 지금까지의 관련 연구에서는 시간축, 패트리 넷(Petri Net), 버퍼 조작 등의 방법을 통하여 동기화를 달성하고 있으나 상호대화라는 측면에서는 만족할 만한 해결책을 제시하지 않고 있다. 본 논문에서는 패트리 넷 모델을 이용하고 상호대화형 객체(Interactive Object)를 각 미디어 파일에 삽입하여 이 객체들이 서로의 정보를 이용할 수 있는 함수를 설계함으로서 동기화와 상호대화성이라는 문제를 해결하였다.

  • PDF

URL Syncronization Mechanism of Lab Note in Collaborative Research Environment (원격 공동연구에서 Lab Note의 URL 동기화에 관한 연구)

  • 김경하;황대준
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 1998.04a
    • /
    • pp.281-286
    • /
    • 1998
  • 본 논문에서는 PC기반의 원격공동 연구 플랫폼에서 제공될 동기모드의 공동작업 서비스 중의 하나인 화상회의 시스템에 사용될 Lab Note의 URL동기화에 대해서 논의한다. 오디오, 비디오, 화이트보드의 객체를 포함하고 있는 화상회이 시스템에 인터넷 환경에서 HTML형식의 문서를 공유할 수 있도록 하기위해 Lab Note 를 설계하였다. 세션이 진행 중일 경우, 모든 사용자는 Lab Note를 통해 HTML문서를 공유하게 되고 이를 위해서는 세션에 참여하고 있는 모든 사용자 간의 URL동기화 작업이 필요하다.

  • PDF

MPEG-H 3D Audio Decoder Structure and Complexity Analysis (MPEG-H 3D 오디오 표준 복호화기 구조 및 연산량 분석)

  • Moon, Hyeongi;Park, Young-cheol;Lee, Yong Ju;Whang, Young-soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.432-443
    • /
    • 2017
  • The primary goal of the MPEG-H 3D Audio standard is to provide immersive audio environments for high-resolution broadcasting services such as UHDTV. This standard incorporates a wide range of technologies such as encoding/decoding technology for multi-channel/object/scene-based signal, rendering technology for providing 3D audio in various playback environments, and post-processing technology. The reference software decoder of this standard is a structure combining several modules and can operate in various modes. Each module is composed of independent executable files and executed sequentially, real time decoding is impossible. In this paper, we make DLL library of the core decoder, format converter, object renderer, and binaural renderer of the standard and integrate them to enable frame-based decoding. In addition, by measuring the computation complexity of each mode of the MPEG-H 3D-Audio decoder, this paper also provides a reference for selecting the appropriate decoding mode for various hardware platforms. As a result of the computational complexity measurement, the low complexity profiles included in Korean broadcasting standard has a computation complexity of 2.8 times to 12.4 times that of the QMF synthesis operation in case of rendering as a channel signals, and it has a computation complexity of 4.1 times to 15.3 times of the QMF synthesis operation in case of rendering as a binaural signals.

Multi-View Point switch System Structure & Implementation of Video player in MPEG-4 based (MPEG-4 시스템 기반의 다시점 전환 시스템 구조 및 재생기 구현)

  • Lee, Jun-Cheol;Lee, Jung-Won;Chang, Yong-Seok;Kim, Sung-Ho
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.44 no.1
    • /
    • pp.80-93
    • /
    • 2007
  • This paper suggests structures of the Object Descriptor and the Elementary Stream Descriptor that provide multi-view video services in 3-Dimensional Audio Video technical standards of current MPEG-4. First, it defines that the structures of the Object Descriptor and the Elementary Stream Descriptor on established MPEG-4 system, then distributes individually, and analyzes that. But extension of established system is inappropriate for providing multi-view audio video services connected transmissions and receptions. And, this paper suggests a structure of new Object Descriptor able to switch viewpoints that considers the correlation between each viewpoints, when multi-view video is transmitted. By means of that, it is able to switch viewpoints according to a requirement of a user in a multi-view video services, and reduce overheads for transmitting information about necessary viewpoint.

Data Carousel Manager based Message Caching for Broadcasting Data Process in Digital Broadcasting Systems (디지털 방송 시스템에서의 방송 데이터 처리를 위한 메시지 캐슁 기반의 데이터 캐루셀 매니저)

  • Won, Jae-Hoon;Kim, Seh-Chang;Ko, Sang-Won;Jeon, Jae-Min;Kim, Jung-Sun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2007.10d
    • /
    • pp.431-434
    • /
    • 2007
  • 국내외 방송환경이 디지털로 급속히 변화함에 따라서 지상파, 케이블, 위성 등의 기존 방송망을 이용하며 서비스 제공자가 제공하는 데이터 서비스를 사용자가 요청 할 때 전송하는 데이터 방송은 기존의 비디오, 오디오 방송 프로그램 이외에 방송과 관련된 데이터 또는 방송과는 직접 관련이 없는 순수한 데이터를 제공하게 되었다. 데이터 방송 표준 단체인 DVB(Digital Video Broadcasting)에서는 데이터 방송 시 데이터 전송 기법으로 데이터 스트리밍(Data Streaming), 데이터 파이핑(Data Piping), 데이터 캐루셀(Data Carousel), 멀티프로토콜 인캡슐레이션(Multiprotocol Encapsulation), 객체 캐루셀(Object Carousel)을 제안하고 있다. 본 논문에서는 데이터 방송에 사용되는 데이터를 효율적으로 관리하기 위하여 메시지 캐슁과 모듈 캐슁을 기반으로 한 데이터 캐루셀 매니저 설계와 구현에 관한 내용을 다룬다.

  • PDF

Implementation of SMIL Editor for Multimedia Broadcasting (멀티미디어 방송을 위한 SMIL 편집 시스템 구현)

  • 장대영;김창수;정회경
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.3
    • /
    • pp.622-629
    • /
    • 2004
  • Recently, as digital broadcasting and internet are spreaded out of the world, we can easily use informations with less restrictions of time and space. According to the current trends, concerns for the ways of representing multimedia data has been rapidly increased, and users demand the services with integrated document that takes not only simple text and image but also time varying audio-visual data. Therefore, in 1998, W3C presented an international standard, SMIL in order to solve multimedia object representation and synchronization problems. By using SMIL, various multimedia elements can be integrated as a multimedia document with proper view in a space and time. Using this SMIL document, we can create new internet radio broadcasting service that delivers not only audio data but also various text, image and video. In this paper, we describe on a SMIL document editor for the common users to be able to represent time varying multimedia data with special layout and synchronization of time and space.

Development of ATSC3.0 based UHDTV Broadcasting System providing Ultra-high-quality Service that supports HDR/WCG Video and 3D Audio, and a Fixed UHD/Mobile HD Service (HDR/WCG 비디오와 3D 오디오를 지원하는 초고품질 방송서비스와 고정 UHD/이동 HD 방송 서비스를 제공하는 ATSC 3.0 기반 UHDTV 방송 시스템 개발)

  • Ki, Myungseok;Seok, Jinwuk;Beack, Seungkwon;Jang, Daeyoung;Lee, Taejin;Kim, Hui Yong;Oh, Hyeju;Lim, Bo-mi;Bae, Byungjun;Kim, Heung Mook;Choi, Jin Soo
    • Journal of Broadcast Engineering
    • /
    • v.22 no.6
    • /
    • pp.829-849
    • /
    • 2017
  • Due to the large-scale TV display, the convergence of broadcasting and broadband, and the advancement of signal compression and transmission technology, terrestrial digital broadcasting has evolved into UHD broadcasting capable of providing simultaneous broadcasting of fixed UHD and mobile HD. The Korean standard for terrestrial UHDTV broadcasting is based on ATSC 3.0, the broadcasting standard of North America. The terrestrial UHDTV broadcasting standard chose that as a new AV codec standard, HEVC video codec which can compress with higher efficiency compared to AVC, and MPEG-H 3D audio codec for realistic audio. Also, DASH and MMT are adopted as transmission format instead of MPEG-2 TS to support broadband as well as broadcasting network, and in order to provide 4K UHD/mobile HD service simultaneously ROUTE multiplexing technology is applied. In this paper, we propose an audio/video encoder, which is required to provide HDR/WCG supported high quality video service, 10.2 channel/4 object supporting stereo sound service, fixed UHD and mobile HD simultaneous broadcasting service based on ATSC3.0, also we implemented the ATSC 3.0 LDM system for ROUTE/DASH packager, multiplexing system and physical layer transmission/reception, and verified the service ability by applying it to real time broadcast environment.

A Unified Method for Vocal Source Separation From Stereophonic Music Signals (스테레오 음악 신호에서의 보컬 음원 분리를 위한 통합 알고리즘)

  • Kim, Min-Je;Jang, In-Seon;Kang, Kyeong-Ok
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.5
    • /
    • pp.89-99
    • /
    • 2010
  • A unified method for separating musical sources, singing voice for example, from stereophonic mixtures is provided. We usually have two observed signals in stereophonic music contents, where more than two instruments are played together. If we regard each instrument as source, this problem becomes an underdetermined source separation problem and cannot be solved by conventional methods, which infers the spatial environment of the downmixing process happens. Instead, source-specific information has been exploited to recover a particular instrumental source. This paper provides a unifying structure consists of heterogenious ad-hoc separate algorithms, which are designed for separating vocal sources using stereophonic channel information and dominant pitch information of the sources, respectively. Experiments on real world music contents show that the proposed unification can neutralize the drawbacks of the two ad-hoc separation algorithms and finally enhance the separation results.