• Title/Summary/Keyword: Spatial Audio

Search Result 90, Processing Time 0.025 seconds

Efficient Representation method of Spatial cues for audio coding (오디오 채널 신호의 압축을 위한 공간 큐의 효율적 표현 방법)

  • Beack, Seung-Kwon;Kim, Min-Je;Lee, Tae-Jin;Jang, Dae-Young;Kang, Kyeong-Ok
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2008.02a
    • /
    • pp.183-186
    • /
    • 2008
  • 본 논문은 공간영역에서의 오디오 채널 신호의 압축 방법에 있어서, 공간 파라메터의 효율적인 표현 방법을 제안하려 한다. 대상이 되는 공간 파라메터는 인간청각의 ILD(Internaural Level Difference) 인지와 관련한 공간 파라메터에 관한 것으로 ICLD(Inter-Channel Level Difference) 파라메터의 표현방법 관한 것이다. 본 논문의 목적은, ICLD 의 통계적 특성을 분석하고 이에 충실한 표현방법을 제안함으로써, 양자화 시 기존 표현 방법보다 왜곡율을 개선시킴으로써 복원된 오디오 신호의 충실도를 높이는 것을 목적으로 한다. 따라서 본 논문에서는, 새로운 ICLD 표현 방법을 소개하고 이에 대한 이론적 통계적 근거를 제시하며, 실험결과로써 기존 방법과 비교된 왜곡율 측정(distortion measure) 결과를 제시하여 제안된 방법의 우수성을 입증한다.

  • PDF

Extended Pilot-Based Coding for Lossless Bit Rate Reduction of MPEG Surround

  • Pang, Hee-Suk;Lim, Jae-Hyun;Oh, Hyen-O
    • ETRI Journal
    • /
    • v.29 no.1
    • /
    • pp.103-106
    • /
    • 2007
  • Pilot-based coding (PBC), which is used for lossless bit rate reduction of audio coding, has been recently proposed for MPEG Surround. We propose extended PBC for further lossless bit rate reduction of MPEG Surround. Extended PBC selects the number of pilots depending on the parameter band number and the type of spatial parameter. It then encodes the pilots and the relevant difference data. Experiments show that extended PBC is more effective than the original PBC, especially for high bit rate modes, with a negligible complexity increase on the decoder side.

  • PDF

Virtual displays and virtual environments

  • Gilkey, R.H.;Isabelle, S.K.;Simpson, B.B.
    • Journal of the Ergonomics Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.101-122
    • /
    • 1997
  • Our recent work on virtual environments and virtual displays is reviewed, including our efforts to establish the Virtual Environment Research, Interactive Technology, And Simulation (VERITAS) facility and our research on spatial hearing. VERITAS is a state-of -the-art multisensory facility, built around the ${CAVE}^{TM}$ technology. High-quality 3D audio is included and haptic interfaces are planned. The facility will support technical and non-technical users working in a wide variety of application areas. Our own research emphasizes the importance of auditory stimulation in virtual environments and complex display systems. Experiments on auditory-aided visual target acquistion, sensory conflict, sound localization in noise, and loxalization of speech stimuli are discussed.

  • PDF

Spatial Audio Rendering for AR and VR (AR 및 VR 을 위한 공간 오디오 렌더링)

  • Sang-Wook Kim;Kyeongok Kang;Taejin Lee
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.11a
    • /
    • pp.14-17
    • /
    • 2022
  • 본 논문에서는 현실세계에서 사용되던 오디오 처리 기법을 가상현실과 증강현실로 확장하는 기술에 대해 제시한다. 메타버스 서비스 구축 등에 활용되는 가상현실 공간을 설계할 때에는 오디오 처리를 위해서 가상현실 공간내 사용자가 위치하는 장면에 따른 소리의 회절과 반사에 따른 잔향 효과를 고려해 줄 수 있어야 장면에 몰입된 사용자 경험이 가능하다. 증강현실 응용에서는 실제 정보와 증강된 효과를 제공하기 위해 가상과 실제 정보간의 위치 정합이 영상 또는 위치를 기반으로 하여 제공되어야 한다. 가상현실과 증강현실 지원을 위해 현실세계 오디오 재생 기술에 추가되어야 하는 기술들과 함께 진행중인 몰입형오디오 서비스를 제공하기 위한 국제표준 기술 개발의 현황을 살펴보고, 향후 추가로 기술이 개발되고 보완되어야 할 부분을 제시한다.

  • PDF

Complex Spatial Cue based Channel Audio Coding (복소 공간큐를 활용한 다채널 오디오 코딩 기술)

  • Beack, Seungkwon;Lim, Wootaek;Lee, Taejin
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.58-60
    • /
    • 2022
  • 본 논문에서는 복소(complex) 공간큐를 활용한 다채널 오디오 부호화 기술을 제안한다. 복소 공간큐 방식의 다채널 오디오 부호화 기술은 시간영역에서 수행된다. 시간영역의 오디오 채널 신호를 복소 데이터로 변환하여 각 오디오 채널 간의 상관관계를 복소 공간큐로 표현하고, 이를 활용하여 채널 부호화를 수행하기 위한 오디오 채널 신호를 생성한다. 참조 기술로는 최고 성능의 오디오 코덱인 USAC의 예측 부호화 방식의 다채널 오디오 부호화 기술과 비교하여 정보량 감축 측면에 있어서 평균 2.24 dB 이상의 높은 SNR을 나타냄을 관측할 수 있었다.

  • PDF

Image Enhancement Techniques for MPEG-4 (MPEG-4 영상의 화질 개선에 관한 연구)

  • 김태근;신정호;백준기
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.169-181
    • /
    • 1997
  • In this paper, we propose and discuss about image enhancement techniques for MPEG-4. which represents very low bit-rate, content-based. and object-based hierarchical audio-visual coding standard. The proposed enhancement technique removes undesired artifacts arising in the compression procedure and increase resolution in both spatial and temporal domains. In order to remove undesired artifacts. we divide the MPEG-4 video algorithm in two parts: MPEG-2 like part and the new part. For removing artifacts caused by the first part. we adopt the conventional blocking artifacts algorithm developed for MPEG-2. On the other hand for removing artifacts caused by the second part. we provide a new degradation model. and propose the corresponding image restoration method. For increasing resolution of the MPEG-4 images, we propose a general framework of multichannel image interpolation process. which includes both spatial and temporal interpolations. As the MPEG-4 standard is under development. various sophisticated techniques are considered. but research on image enhancement techniques is relatively underestimated. By this reason. additional image enhancement techniques will become very important issue in realization phase of MPEG-4.

  • PDF

Speech Enhancement for Voice commander in Car environment (차량환경에서 음성명령어기 사용을 위한 음성개선방법)

  • 백승권;한민수;남승현;이봉호;함영권
    • Journal of Broadcast Engineering
    • /
    • v.9 no.1
    • /
    • pp.9-16
    • /
    • 2004
  • In this paper, we present a speech enhancement method as a pre-processor for voice commander under car environment. For the friendly and safe use of voice commander in a running car, non-stationary audio signals such as music and non-candidate speech should be reduced. Ow technique is a two microphone-based one. It consists of two parts Blind Source Separation (BSS) and Kalman filtering. Firstly, BSS is operated as a spatial filter to deal with non-stationary signals and then car noise is reduced by kalman filtering as a temporal filter. Algorithm Performance is tested for speech recognition. And the results show that our two microphone-based technique can be a good candidate to a voice commander.

Nursing Home Environment with Positive Distraction for Reduction of Chronic Pain and Healing (만성통증의 경감과 치유를 위한 노인요양시설의 긍정적 관심 전환 환경)

  • Chung, Miryum
    • Korean Institute of Interior Design Journal
    • /
    • v.24 no.2
    • /
    • pp.206-216
    • /
    • 2015
  • Majority of the seniors living in nursing homes suffer from persistent chronic pain, which may cause depression and compromised quality of life if untreated. The environment should support them to lift their focus from current pain and worries to the positive feelings and the delight of life. The purpose of this research is to classify the healing environment elements for positive distraction, and analyze 6 international cases to see the current situation. The elements were categorized as follows, based on literature review from both healing spaces and elderly care field: spatial elements(view, natural elements, artificial elements, exercise space, garden), psychological elements(grooming area, space for privacy, meal/drink area, elements for recollection, religious space), social elements(common living area, activity/hoppy room, family/visitor area, information area, local community program space). Analysis on 6 facilities showed that each elements were reflected to designed relatively well. New inventions from workers who think distraction is important were also introduced. Healing environment for positive distraction requires delicate touch, derived from understanding characteristics and situation of the residing elderly individuals. Technology update is also significant, from audio books to virtual reality devices, since cultural life of nursing home is far behind from what the others enjoy now.

Study on Air Absorption Processing for Spatial Audio Rendering (공간음향 렌더링을 위한 공기흡음 처리에 관한 연구)

  • Daeyoung Jang;Yong Ju Lee;Jae-hyoun Yoo;Kyeongok Kang;Tae Jin Lee
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.11a
    • /
    • pp.18-21
    • /
    • 2022
  • 본 논문에서는 6 자유도 공간음향 렌더링 기술 관련 음향객체의 거리감 인지에 중요한 공기흡음 감쇠 효과 처리에 있어, 현장의 음원과 음향 센서 사이의 거리인 녹음거리에 해당하는 공기흡음 감쇠가 기본적으로 포함되어 3kHz 이상의 고주파 성분이 감쇠된 음원이 렌더링에 사용되는 문제점을 해결하는 방법을 제시한다. 이 방법에 의하면 6 자유도 공간음향 콘텐츠에 메타데이터로서 녹음거리 파라메터를 포함시키고, 렌더링할 때 공기흡음을 적용하기 위한 음원과 청취자의 거리값에 녹음거리에 대한 보상을 적용함으로써, 음원의 공기흡음 감쇠 효과를 정확하게 수정 적용하여 음원의 음색을 모든 거리에서 실제에 가깝도록 제공할 수 있게 된다. 특히, 원거리 녹음이 불가피한 비행기, 천둥, 폭발음 등 원거리 녹음음원의 음색에 녹음거리에 의한 음원의 공기흡음 감쇠가 적지 않은 영향을 미치게 되는데, 녹음거리의 적용에 의한 제안한 방법에 의해 음원과 청취자의 거리값에 대한 음원의 음색이 고주파영역의 녹음거리에 의한 원치 않는 감쇠를 보상하는 효과를 확인할 수 있었다.

  • PDF

The Use of Graphic Novels for Developing Multiliteracies (그래픽노블을 통한 다중문식성의 발달)

  • Yun, Eunja
    • Journal of English Language & Literature
    • /
    • v.56 no.4
    • /
    • pp.575-596
    • /
    • 2010
  • The modes of narratives and communication have expanded due to social and cultural changes and technological development. Thus texts have become multimodal and media hybridities and media crossover have been increasing as well. Multimodality requires new literacy to understand and interpret those multimodal texts other than existing traditional literacy approaches. The New London Group (2000) argues that multiliteracies are needed to serve today's changing multimodal texts. Kress (2003) also argues, visual texts have been prevailing, being mingled with other modes of texts such as linguistic, audio, gestural, and spatial modes. Literary texts are not exception in this trend of multimodality. The recent renaissance of comics, in particular, the new light on graphic novels can be interpreted in this historical vein. In comparison to comics, no consensus has been made in defining graphic novels, however, many studies have been recently conducted in order to look into the potential of graphic novels in building multiliteracies. In this paper, the graphic novel as a literary genre are explored from a histocial perspective and the definition of graphic novels was attempted to be made. In the light of multiliteracies, this paper presented cases that show how graphic novels can be utilized to build multiliteracies. Lastly, the use of graphic novels for English as a foreign language was introduced as well. The author hopes that at the age of multimodality, the potential graphic novels have in language and literacy education can be taken into account by language teachers and students in expanding their territory of literacy.