• 제목/요약/키워드: Realistic Audio

검색결과 64건 처리시간 0.025초

Realistic Audio Teleconferencing using Binaural and Auralization Techniques

  • Kang, Seong-Hoon;Kim, Sung-Han
    • ETRI Journal
    • /
    • 제18권1호
    • /
    • pp.41-51
    • /
    • 1996
  • The goal of telecommunication may be to enable the participants in distant places to communicate with each other in an environment as if they were in the same room. This paper introduces the reason why realistic audio display is useful in telecommunication, reviews some approaches to its implementation, and proposes an audio teleconference model which realizes a two-way telecommunication with realistic sensations using binaural and auralization techniques.

  • PDF

On the Realistic Audio Teleconferencing using Auralization Technique

  • Kang, Seong-Hoon
    • The Journal of the Acoustical Society of Korea
    • /
    • 제15권1E호
    • /
    • pp.77-83
    • /
    • 1996
  • The goal of telecommunication may be to enable the participants in distant places to communicate with each other in an environment as if they were in the same room. This paper introduces the reason why realistic audio display is useful in telecommunication, reviews some approaches to its implementation, and proposes an audio teleconference model which realizes a two-way telecommunication with realistic sensations using binaural and auralization techniques.

  • PDF

증강현실에서 객체와 오디오의 상호작용 (Interaction between Object and Audio in Augmented Reality)

  • 조현욱;이종근;이종혁
    • 한국정보통신학회논문지
    • /
    • 제15권12호
    • /
    • pp.2705-2711
    • /
    • 2011
  • 최근 멀티미디어 기술의 발달, 특히 음향 기술의 급격한 발달과 더불어 고품질 오디오에 대한 요구와 함께보다 현실감 있는 오디오를 재생하기 위한 실감 오디오기술 개발이 요구되고 있다. 이러한 요구를 만족시키기 위해 사용자의 가상현실 및 증강현실에서 실감나는 오디오 효과를 제공해 줄 수 있는 3차원 오디오에 대한 연구가 활발히 진행되고 있다. 본 논문에서는 증강현실에서 좀 더 나은 오디오 기술을 적용하여 실감나는 오디오 효과를 제공해 줄 수 있는 방법을 연구하고자 하였다. 연구한 내용은 가상세계와 실제세계의 현실감을 제공하기 위하여 마커 위에 띄워진 3D 모델의 움직임에 따라서 움직임에 맞는 사운드. 즉, 거리, 각도 등의 변화에 따른 사운드의 크기 및 피치 변화를 줄 수 있도록 하였다.

UHDTV를 위한 실감 오디오 재현 기술 (A Study on Realistic Sound Reproduction for UHDTV)

  • 장대영;서정일;이용주;유재현;박태진;이태진
    • 방송공학회논문지
    • /
    • 제20권1호
    • /
    • pp.68-81
    • /
    • 2015
  • 최근 부품기술 및 미디어 처리기술의 발전과 함께 HDTV를 이을 UHDTV 서비스가 곧 도래할 것이라는 예상이 기정사실화되고 있다. 이에 따라 HDTV에서 5.1채널 서라운드 사운드를 제공했던 오디오 기술도 UHDTV 시대의 도래와 함께 어떠한 서비스를 제공하여야 할지 고민하여야 할 시점에 와 있다. 그러나 현실은 HDTV의 5.1채널 사운드 포맷조차도 가정에서의 설치 및 유지의 어려움으로 인해 시장에서의 고전을 면치 못하고 있다. 한편, 영화 사운드 시장에서는 오랫동안 사용되고 있던 5.1, 7.1 채널 사운드 포맷이 돌비 ATMOS, IOSONO, AURO3D 등 천정 사운드와 객체기반 오디오를 포함하는 하이브리드 오디오 기술이 잇달아 도입되면서 일대 격변기를 맞이하고 있다. 이러한 객체기반 오디오 기술은 홈씨어터 및 방송 오디오 시장에서도 도입이 확실시되고 있는 실정이며, 이러한 오디오 기술의 변화는 유연성이 결여된 채널기반 오디오의 기술 발전 및 시장 성장의 활로를 개척하는 호기가 될 것으로 전망된다. 따라서 본 논문에서는 UHDTV 방송에 적합한 실감 오디오 기술에 대한 고찰과 이와 관련된 하이브리드 오디오 기술의 콘텐츠 포맷 및 가정에서의 재현 방안에 대해서 기술하고 향후 전망을 고찰해 보고자 한다.

A 3D Audio-Visual Animated Agent for Expressive Conversational Question Answering

  • Martin, J.C.;Jacquemin, C.;Pointal, L.;Katz, B.
    • 한국정보컨버전스학회:학술대회논문집
    • /
    • 한국정보컨버전스학회 2008년도 International conference on information convergence
    • /
    • pp.53-56
    • /
    • 2008
  • This paper reports on the ACQA(Animated agent for Conversational Question Answering) project conducted at LIMSI. The aim is to design an expressive animated conversational agent(ACA) for conducting research along two main lines: 1/ perceptual experiments(eg perception of expressivity and 3D movements in both audio and visual channels): 2/ design of human-computer interfaces requiring head models at different resolutions and the integration of the talking head in virtual scenes. The target application of this expressive ACA is a real-time question and answer speech based system developed at LIMSI(RITEL). The architecture of the system is based on distributed modules exchanging messages through a network protocol. The main components of the system are: RITEL a question and answer system searching raw text, which is able to produce a text(the answer) and attitudinal information; this attitudinal information is then processed for delivering expressive tags; the text is converted into phoneme, viseme, and prosodic descriptions. Audio speech is generated by the LIMSI selection-concatenation text-to-speech engine. Visual speech is using MPEG4 keypoint-based animation, and is rendered in real-time by Virtual Choreographer (VirChor), a GPU-based 3D engine. Finally, visual and audio speech is played in a 3D audio and visual scene. The project also puts a lot of effort for realistic visual and audio 3D rendering. A new model of phoneme-dependant human radiation patterns is included in the speech synthesis system, so that the ACA can move in the virtual scene with realistic 3D visual and audio rendering.

  • PDF

A Performance Assessment of Real-time Multichannel Audio Codec

  • Kim, Sunghan;Jang, Daeyoung;Hong, Jinwoo
    • The Journal of the Acoustical Society of Korea
    • /
    • 제16권3E호
    • /
    • pp.56-61
    • /
    • 1997
  • In this paper, we describe a real-time implementation of a multi-channel auido codec system that is based on the MPEG-1 audio algorithm. The major feature of this system is that it has a flexible multi-DSP system that can be adapted for various applications with using up to four TMS320C40 DSPs. The purpose of this paper is to present the problems of the system and is to describe the optimized methods to solve the problems in the view of hardware and software. Our audio codec is composed of an encoder an a decoder system and the bit rate of bitstream is up to 384 kbps. Fast input/output interfaces, DSP overloads, and inter-DSP communications methods with high speed are considered in multi-DSP H/W. Also, to run real-time in S/W, optimizing methods of algorithm are considered. After implementation of system, the subjective assessment method, and 'triple stimulus/hidden reference/double blind' that recommended by ITU-R TG10/3 is adopted for the quality of our system. All test items except one are awarded difference grades(diffgrade) better than 1-. Form the results, multi-channel audio system can be used for HDTV service.

  • PDF

사용자 취향, 감성 및 상황인지 기반 음원 추천 서비스 구현 (A Design of real sound recommendation service based-on User's preference, emotion and circumstance)

  • 정종진;임태범;이석필
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2011년도 춘계학술발표대회
    • /
    • pp.689-691
    • /
    • 2011
  • Due to the rapid development of Information and communication, the technology of multimedia presentation technology is evolving into the service that user can actively, realistically enjoy and play based on user's preference and taste not only for User's passive service. Especially, the industry related the realistic multimedia service that supports targeting Human emotion with the property of Human hearing is expected to be formed of the high value-added premium market. Audio technology is affected on human's emotion and the viewing environment around than video technology. Also the audio technology compared to video technology is a research part that appeals to human emotion and emphasize on psychological aspects. With this viewpoint, the development of intelligent and realistic audio technology needs highly specialty. In this study, "intelligent real-sound presentation technology" that support high quality and realistic audio and the "core technologies" that are composing of this will be introduced.

비압축 HD급 영상 및 고음질 음성 출력을 지원하는 휴대용 게임기 구현 (Development of portable game device with uncompressed HD video and high quality sound output)

  • 이충희;이종훈;정우영
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2006년 학술대회 논문집 정보 및 제어부문
    • /
    • pp.391-393
    • /
    • 2006
  • In this paper, we develop a portable game device with uncompressed HD video and high quality sound output. Portable game devices support not only game function but also various complex functions recently. It especially supports TV-Out port to play realistic game, connecting a large screen display device. But the video and audio output signals of conventional TV-out port have the low performance and these signals are analog output. So, it is difficult that the users enjoy realistic game with benefit of high resolution digital TV. We propose the game device output with uncompressed digital signal, which has no delay of video/audio signal, also has strong immunity to external noise. Since it supports a high resolution video and high quality sound, users can playa realistic game. First, we implement the HDMI to the game device and we test reliability with the various resolutions video inputs and audio inputs. The proposed method can be applied multimedia devices requiring high performance output function as well as portable devices.

  • PDF

MPEG-I Immersive Audio 표준화 및 기술 동향 (Standardization of MPEG-I Immersive Audio and Related Technologies)

  • 장대영;강경옥;이용주;유재현;이태진
    • 전자통신동향분석
    • /
    • 제37권3호
    • /
    • pp.52-63
    • /
    • 2022
  • Immersive media, also known as spatial media, has become essential with the decrease in face-to-face activities in the COVID-19 pandemic era. Teleconference, metaverse, and digital twin have been developed with high expectations as immersive media services, and the demand for hyper-realistic media is increasing. Under these circumstances, MPEG-I Immersive Media is being standardized as a technologies of navigable virtual reality, which is expected to be launched in the first half of 2024, and the Audio Group is working to standardize the immersive audio technology. Following this trend, this article introduces the trend in MPEG-I immersive audio standardization. Further, it describes the features of the immersive audio rendering technology, focusing on the structure and function of the RM0 base technology, which was chosen after evaluating all the technologies proposed in the January 2022 "MPEG Audio Meeting."

Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding

  • Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Hong, Jin-Woo
    • ETRI Journal
    • /
    • 제31권4호
    • /
    • pp.365-375
    • /
    • 2009
  • In this paper, a terrestrial digital multimedia broadcasting (T-DMB) multichannel audio broadcasting system based on spatial audio coding is presented. The proposed system provides realistic multichannel audio service via T-DMB with a small increase of data rate as well as backward compatibility with the conventional stereo-based T-DMB player. To reduce the data rate for additional multichannel audio signals, we compress the multichannel audio signals using the sound source location cue coding algorithm, which is an efficient parametric multichannel audio compression technique. For compatibility, we use the dependent property of an elementary stream descriptor, and this property should be ignored in a conventional T-DMB player. To verify the feasibility of the proposed system, we implement the T-DMB multichannel audio encoder and a prototype player. We perform a compatibility test using the T-DMB multichannel audio encoder and conventional T-DMB players. The test demonstrates that the proposed system is compatible with a conventional T-DMB player and that it can provide a promisingly rich audio service.