Search | Korea Science

Performance analysis of acoustic event detection algorithm using weakly labeled data (Weakly labeled 데이터 기반 음향 이벤트 인식 알고리즘 성능 분석)

Lim, Wootaek;Suh, Sangwon;Park, Sooyoung;Jeong, Youngho;Lee, Taejin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2019.06a
- /
- pp.160-162
- /
- 2019
음향 이벤트 인식 기술은 오디오 신호에서 음향 이벤트를 예측하는 기술로, 최근 대용량 데이터베이스의 배포, 인식 알고리즘과 하드웨어의 발전, 관련 인식 대회 등에 힘입어 많은 연구가 이루어지고 있는 분야이다. 본 논문에서는 음향 장면 및 이벤트 인식 관련 대회인 DCASE 챌린지에 대하여 기술하고, 약한 레이블 기반의 데이터를 학습해 강한 레이블을 예측하는 DCASE 챌린지 과제 4에 대하여 설명한다. 또한 DCASE 챌린지 과제 4에 제출된 다양한 음향 이벤트 인식 알고리즘과 데이터베이스의 종류에 따른 성능을 비교하여 음향 이벤트 인식 성능을 분석한다.
PDF

Implementation of a Person Tracking Based Multi-channel Audio Panning System for Multi-view Broadcasting Services (다시점 방송 서비스를 위한 사용자 위치추적 기반 다채널 오디오 패닝 시스템 구현)

Kim, Yong-Guk;Yang, Jong-Yeol;Lee, Young-Han;Kim, Hong-Kook
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.150-157
- /
- 2009
In this paper, we propose a person tracking based multi-channel audio panning system for multi-view broadcasting services. Multi-view broadcasting is to render the video sequences that are captured from a set of cameras based on different viewpoints, and multi-channel audio panning techniques are necessary for audio rendering in these services. In order to apply such a realistic audio technique to this multi-view broadcasting service, person tracking techniques which are to estimate the position of users are also necessary. For these reasons, proposed methods are composed of two parts. The first part is a person tracking method by using ultrasonic satellites and receiver. We could obtain user's coordinates of high resolution and short duration about 10 mm and 150 ms. The second part is MPEG Surround parameter-based multi-channel audio panning method. It is a method to obtain panned multi-channel audio by controlling the MPEG Surround spatial parameters. A MUSHRA test is conducted to objectively evaluate the perceptual quality and measure localization performance using a dummy head. From the experiments, it is shown that the proposed method provides better perceptual quality and localization performance than the conventional parameter-based audio panning method. In addition, we implement the prototype of person tracking based multi-view broadcasting system by integrating proposed methods with multi-view display system.
PDF

실감방송을 위한 오디오 표준화 현황

Seo, Jeong-Il;Yu, Jae-Hyeon;Gang, Gyeong-Ok;Jang, Se-Jin
- Broadcasting and Media Magazine
- /
- v.19 no.1
- /
- pp.37-47
- /
- 2014
PDF KSCI

A Study on Realistic Sound Reproduction for UHDTV (UHDTV를 위한 실감 오디오 재현 기술)

Jang, Daeyoung;Seo, Jeongil;Lee, Yong Ju;Yoo, Jae-Hyoun;Park, Taejin;Lee, Taejin
- Journal of Broadcast Engineering
- /
- v.20 no.1
- /
- pp.68-81
- /
- 2015
Owing to the latest development of component and media processing technologies, UHDTV as a successor of the HDTV is expected that this will be coming soon realization. Accordingly, an audio technology that provides a 5.1-channel surround sound in home should be contemplating on what services should be provided with the advent of UHDTV era. In fact, however, the market of 5.1-channel audio is struggling, due to the difficulty of installation and maintenance of the multi speakers in a home. Meanwhile, the movie sound market for a long time been used in 5.1 and 7.1-channel sound formats, have changed as Dolby ATMOS, IOSONO, AURO3D etc. are launched one after another with the introduction of hybrid audio technologies that include the ceiling and object-based sounds. This very object-based audio technology is assured to be introduced in the home theater and broadcast audio market, and this change in audio technology is expected to be a breath of pioneering technological advances and market growth from the channel-based audio market that lacks flexibility. In this paper, we will investigate a suitable realistic audio solution for UHDTV, and introduce hybrid audio technologies, which is expected to be an audio technology for UHDTV, and we will describe the hybrid audio content format and reproduction methods in a home and consider the future prospects of realistic audio.
https://doi.org/10.5909/JBE.2015.20.1.68 인용 PDF KSCI KPUBS HTML

A Sound Externalization Method for Realistic Audio Rendering in a Headphone Listening Environment (헤드폰 청취환경에서의 실감 오디오 재현을 위한 음상 외재화 기법)

Kim, Yong-Guk;Chun, Chan-Jun;Kim, Hong-Kook;Lee, Yong-Ju;Jang, Dae-Young;Kang, Kyeong-Ok
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.47 no.5
- /
- pp.1-8
- /
- 2010
In this paper, a sound externalization method is proposed for out-of-the-head localization in a headphone listening environment. In order to reduce timbre distortion by the conventional methods using a measured a head-related transfer function (HRTF) or early reflections, the proposed method integrates a model-based HRTF with reverberation. In addition, for improving frontal externalization performance, techniques such as decorrelation and spectral notch filtering are included. To evaluate the performance of the proposed externalization method, subjective listening tests are conducted by using different types of sound sources such as white noise, sound effects, speech, and music. It is shown from the test results that the proposed externalization method can localize sound sources farther away from out of the head than the conventional method.
PDF KSCI

A Desigin of real sound service based-on object in Smart mobile devices (스마트 모바일 기기에서의 객체 기반 실감 음원 서비스 구현)

Jung, Jong-Jin;Lim, Tae-Beom;Lee, Seok-Pil
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.04a
- /
- pp.685-688
- /
- 2011
앞으로의 멀티미디어 기기시장은 기존의 단순 복합 디지털 기기들이 아닌 사용자 감성 및 취향 제어가 가능한 인간 친화적 지능형 멀티미디어 기기가 주류를 이룰 것이다. 이미 IT 기능이 기존의 '정보의 소통'에서 '감성의 소통'으로 진화 중에 있으며, 미래시대에는 느낌까지 디지털 신호로 전달 가능한 기술이 발달 될 것이다. 이에 맞추어 사람의 감성, 주변 분위기, 섬세한 공간 정보를 전달하는 사실적인 오디오 개발 및 인프라가 구축되어 모든 멀티미디어 제품에 적용된다면, 사용자는 보다 현장감 있게 멀티미디어를 즐길 수 있을 것이다. 최근 스마트폰의 확산과 더불어 각종 다양한 음악서비스를 제고하는 웹/앱 형태의 어플리케이션이 증대되고 있는 바, 본 논문에서는 안드로이드 기반 스마트 모바일 기기에서 다양한 오디오 정보를 청취자에게 제공하고 이를 활용하여 청취자가 다양하게 오디오 재생 / 제어하여, 일방적으로 청취자가 오디오를 듣는 수준이 아니라 청취자 취향에 따라 다양하게 오디오를 감상 할 수 있는 서비스를 구현하였다.
https://doi.org/10.3745/PKIPS.y2011m04a.685 인용 PDF

An Architecture for 3D Audio Core Algorithm Evaluation DB (3차원 입체 음향 핵심 알고리즘 평가를 위한 DB 설계)

Hwang, Jaemin;Kim, Jeonghyuk;Kang, Sanggil
- Journal of Information Technology and Architecture
- /
- v.11 no.2
- /
- pp.225-233
- /
- 2014
In this paper an architecture for 3D audio core algorithm evaluation database system. Due to increase of 3D audio system through multimedia device, an evaluation system is required for evaluating the 3D core algorithms for developing 3D audio system. Conventional evaluation systems have some problems. Researchers have to learn usage of evaluation system, in addition it is inefficient to use and search audio sources because audio sources are not indexed in general. To solve these problems, we design the architecture of 3D audio core algorithm evaluation database system enabling to automatically evaluate core algorithms using database management system. Also we define XML metadata scheme for information of saved audio source in database. This approach allows improving efficiency of search audio source and use of audio database.
KSCI

Yoga of Consilience through Immersive Sound Experience (실감음향 체험을 통한 통섭의 요가)

Hyon, Jinoh
- Journal of Broadcast Engineering
- /
- v.26 no.5
- /
- pp.643-651
- /
- 2021
Most people acquire information visually. Screens of computers, smart phones, etc. constantly stimulate people's eyes, increasing fatigue. In this social phenomenon, the realistic and rich sound of the 21st century's state-of-art sound system can affect people's bodies and minds in various ways. Through sound, human beings are given space to calm and observe themselves. The purpose of this paper is to introduce immersive yoga training based on 3D sound conducted together by ALgruppe & Rory's PranaLab and to promote the understanding of immersive audio system. As a result, people, experienced immersive yoga, not only enjoy the effect of sound, but also receive a powerful energy that gives them a sense of inner self-awareness. This is a response to multidisciplinary exchange required by the knowledge of modern society, and at the same time, informs the possibility of new cultural contents.
https://doi.org/10.5909/JBE.2021.26.5.643 인용 PDF KSCI KPUBS

Next-generation loudspeaker layout for Ultra High Definition (UHD) Digital TV (초고선명 디지털 TV 를 위한 차세대 라우드스피커 레이아웃)

Lee, Young Woo;Kim, Sunmin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.57-60
- /
- 2011
본 논문에서는 초고선명 디지털 TV 를 위한 차세대 멀티채널 사운드 시스템의 최적의 라우드스피커 레이아웃을 도출하기 위해 다양한 라우드스피커 배치 환경에서 인지 관점의 오디오 음질 주관평가를 실시하였다. NHK 22.2 채널 시스템, ITU-R BS.775-2 표준의 7.1 채널 시스템과, 실감 음향에 가장 중요한 역할을 하는 Top Layer 라우드스피커에 중점을 두고 다양한 신규 레이아웃 구성들을 비교하였으며, 스튜디오에서 믹싱된 컨텐츠와 B-format 레코딩을 멀티채널로 생성한 컨텐츠를 이용하여 주관 평가를 실시하였다. 주관 평가 결과, Top Layer 에 3 개의 라우드스피커를 가지는 10.2 채널 라우드스피커 레이아웃이 평가에서 사용된 전체적인 오디오 음질의 등급에서 NHK 22.2 채널 시스템과 차이를 인지하기 어렵다는 결과를 도출하였다.
PDF

Sound Localization Change Research Using a Headphone (헤드폰을 이용한 음상 정위 변화 연구)

Park, Yoon Jung;Jang, Dalwon;Shin, Saim;Lee, JongSeol;Jang, Sei-Jin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.07a
- /
- pp.153-154
- /
- 2015
최근 영상 및 음향 기술이 발전함에 따라서 실감 오디오에 대한 기술이 촉구 되고 있으며 커넥티드 오디오에 대한 수요와 공급이 매년 증가하고 있다. 본 논문은 헤드폰을 이용해서 사용자에게 제공되는 일반적인 스테레오 신호 및 모노 신호를 음상 정위을 변화시키기 위해서 HRTF (Head response transfer function)을 적용하였으며 인공 잔향을 이용해서 공간감을 구현하였다. 실제 실험을 위해서 MATLAB을 이용하여 시물레이션을 구현하였으며 MATLAB의 GUI를 통해서 사용자에게 음상의 위치를 입력받게 된다. 이를 통해 음상이 사용자가 입력한 순서의 경로를 따라서 음상 정위가 변화에 대해서 입증하였다.
PDF

Search Result 45, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)