Search | Korea Science

Listenable Explanation for Heatmap in Acoustic Scene Classification (음향 장면 분류에서 히트맵 청취 분석)

Suh, Sangwon;Park, Sooyoung;Jeong, Youngho;Lee, Taejin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.07a
- /
- pp.727-731
- /
- 2020
인공신경망의 예측 결과에 대한 원인을 분석하는 것은 모델을 신뢰하기 위해 필요한 작업이다. 이에 컴퓨터 비전 분야에서는 돌출맵 또는 히트맵의 형태로 모델이 어떤 내용을 근거로 예측했는지 시각화 하는 모델 해석 방법들이 제안되었다. 하지만 오디오 분야에서는 스펙트로그램 상의 시각적 해석이 직관적이지 않으며, 실제 어떤 소리를 근거로 판단했는지 이해하기 어렵다. 따라서 본 연구에서는 히트맵의 청취 분석 시스템을 제안하고, 이를 활용한 음향 장면 분류 모델의 히트맵 청취 분석 실험을 진행하여 인공신경망의 예측 결과에 대해 사람이 이해할 수 있는 설명을 제공할 수 있는지 확인한다.
PDF

The Next Song Recommendation Using Item Sequences in Music Usage Data (사용자 청취 로그의 음악 청취 순서를 이용한 다음 음악 추천)

Park, Sung-Eun;Lee, Dong-Joo;Lee, Sang-Keun;Lee, Sang-Goo
- Proceedings of the Korean Information Science Society Conference
- /
- 2011.06c
- /
- pp.41-44
- /
- 2011
본 연구는 현재 사용자가 청취한 음악과 청취한 순서를 기반으로 다음에 사용할 아이템을 추천하는 문제를 다룬다. 우리가 제시하는 모델은 아이템 사용 로그를 기반으로 하며, 정보검색에서 많이 사용하는 N-gram모델을 사용하여 아이템의 순서열을 추출한 후 다음에 올 확률이 높은 아이템을 학습한다. 그리고 사용자가 현재 선택한 아이템의 순서열을 기반으로 다음에 가장 들을 확률이 높은 아이템을 추천한다. 또 실 세계 데이터를 기반으로 실험하여 협업적 필터링 방식과 성능을 비교한다.

A Study on the Sweet-Spot Widening using 2-Channel Sound Transaural Filter (2채널 트랜스오럴 필터를 이용한 최적 청취영역 확대에 관한 연구)

Ahn Chan-Shik;Hwang Shin;Kim Soon-Hyob
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.53-56
- /
- 2002
본 논문은 2채널 스피커를 사용하여 청취자에게 보다 입체적인 음향 효과를 제시하기 위하여 크로스토크현상을 제거하고 청취자의 보다 자유로운 청취를 위해 최적 청취영역 확대를 위한 실험과 시스템 구현에 관한 것이다. 정면에 위치한 두 스피커로부터 교차경로인 크로스토크를 제거하기 위해 음질의 왜곡을 최소화하는 자유음장 모델을 이용하여 구현한 트랜스오럴 필터 사용하였고 최적 청취영역의 확대를 위해 스피커는 BPF(Band Pass Filter)를 이용하여 저주파와 고주파를 분리하여 각각 재생할 수 있는 스피커를 구성하였으며 저주파 영역은 제외하고 중고주파 영역을 이용하였으며 기존 크로스토크제거 시스템을 사용하여 고정된 한 점의 청취영역에서 좌${\cdot}$우로 5Cm씩 이동하au 100Cm까지 측정한 결과 30Cm, 55Cm, 75Cm, 90Cm, 100Cm에서 크로스토크제거됨을 알 수 있는 음의 분리도가 5dB이상 나타났다. 실험 결과 얻어진 각 지점들로부터 자유음장 모델을 이용하여 트랜스오럴 필터링 하였으며 각각의 간섭현상을 막기 위해 주파수 영역에서 심리음향에 기초한 1/3-Octave Band Pass Filter를 사용하여 음질 보상을 실시하였다. 음원을 제작하여 기존의 2채널 시스템에서 제시하는 음원을 각각의 위치의 음원과 비교하여 음질 평가를 실시하였으며 기존의 트랜스오럴 필터와 비교평가를 실시하였다.
PDF

A Study of Optimum Time-Spread Echo Audio Watermarking via Listening Test (청취실험에 의한 에코확산 오디오 워터마킹방법의 최적화에 관한 검토)

Ko Byeong-Seob
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.545-546
- /
- 2004
서브밴드 분리에 의한 에코확산 오디오 워터마킹법은 호스트 신호를 특정 주파수 대역으로 분리하고, MPEG 심리음향 모델을 이용하여 각 대역별로 삽입되는 워터마크의 파워를 파라미터 설정 함수에 의하여 설정한다. 여기서, 본 방법의 강인성과 비지각성을 좌우하는 것은 파라미터 설정 함수가 된다. 따라서, 본 연구에서는 최대의 강인성과 최소의 음질 열화를 구현하기 위하여 청취실험을 실시하여 최적의 파라미터 설정 함수 설정방법에 대한 검토를 수행하였다.
PDF

Application of Rise/Fall/Connection(RFC) Model to Korean Intonation (RFC 모델의 한국어 억양 곡선에의 적용)

표경란
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.214-217
- /
- 1998
합성음에 사용할 한국어 억양 모델을 세우기 위한 기초적 연구로서 한국어 억양 곡선에 RFC 모델을 적용해 보았다. 억양 곡선의 구조는 피치 액센트와 억양구 경계 음조의 연속으로 되어 있는데, RFC 모델은 각각의 진폭과 지속기간을 가지는 상승 음조 요소와 하강 음조요소, 그리고 연결 요소로 이러한 억양 곡선의 모양을 모델링한다. 본 논문에서는 한국어 억양 곡선의 특징을 잘 반영하도록 RFC 모델의 구성요소를 수정하고, 청취 실험을 통해서 원래의 RFC 모델과 수정된 RFC 모델을 비교해 보았다. 실험 결과는 수정된 RFC 모델이 원래의 RFC 모델보다 13%정도 음조 표지 개수가 줄었음에도 불구하고 청각적으로 인지하는데 차이가 없는 것으로 나타났다.
PDF

Modeling of Distance Localization by Using an Extended Auditory Parallax Model (확장된 음향적 시차 모델을 이용한 음상 거리정위의 모델화)

김해영
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.1
- /
- pp.30-39
- /
- 2004
This study aims at establishing a digital signal processing technique to control 3-D sound localization, especially focusing our ores on the role of information provided by Head-Related Transfer Function (HRTF). In order to clarify the cues to control the auditory distance perception, two conventional models named Hirsch-Tahara model and auditory parallax model were examined. As a result, it was shown that both models have limitations to universally explain the auditory distance perception. Hence, the auditory parallax model was extended so as to apply in broader cases of auditory distance perception. The results of the experiment by simulating HRTFs based on the extended parallax model showed that the cues provided by the new model were almost sufficient to control the perception of auditory distance from an actual sound source located within about 2m.
PDF KSCI

Listening environment design of houses for the seniors aging at home (고령자의 청력손실을 배려한 재가보호 주거시설의 음향설계)

Yu-Kyeong Jang;Yang-ki Oh
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.2
- /
- pp.152-161
- /
- 2024
Although hearing loss in the elderly is one of the common symptoms of aging, as the aging population continues to grow, policies such as home care and welfare housing for the elderly are implemented with a focus on mobility, often overlooking the issue of hearing loss in the elderly. In this study, our aim is to enhance the quality of life for the elderly by improving the auditory environment within residential spaces, which plays a pivotal role in determining their overall well-being. We have proposed a technique that focuses on reducing reverberation, minimizing noise levels, and enhancing sound quality to improve the listening environment for the elderly, and we have verified its effectiveness. Building upon this, we have developed an acoustic design model for residential facilities catering to elderly home care.
https://doi.org/10.7776/ASK.2024.43.2.152 인용 PDF

Analysis and Evaluation Simulation System for Whistle Sound Related Marine Casualty (기적음관련 해양사고 분석.평가 시뮬레이션 시스템 개발)

임정빈;김창경
- Proceedings of the Korean Institute of Navigation and Port Research Conference
- /
- 2004.04a
- /
- pp.61-67
- /
- 2004
This paper describes Three-Dimensional Listening Simulation System (3D-LSS) which is to analyze whistle sound related marine casualties, and is to evaluate the accident situations using 3D sound by Head Related Transfer Function. At first, the three-dimensional listening model from the analysis of accident situations is proposed, and then the reproduction and evaluation methods of 3D sounds are also discussed. The system is designed to explain the accident situations and to simulate the possible situations with GUI based graphics and 3D sound reproduction. The evaluation experiments using 3D-LSS are carried out with six cases that did not known whether it is true or not the blast and listening of the whistle sound between two vessels. As results of psychological assessments by five subjects, the six cases can be analyzed clearly by visual images and audio sounds, thus the usability of 3D-LSS as one of the judgment assistant system of marine casualty is verified.
PDF

Development of Analysis and Evaluation Simulation System for Whistle Sound Related Marine Casualty (기적음관련 해양사고 분석·평가 시뮬레이션 시스템 개발)

Yim, Jeong-Bin;Kim, Chang-Kyoung
- Journal of Navigation and Port Research
- /
- v.28 no.8
- /
- pp.659-666
- /
- 2004
This paper describes Three-Dimensional Listening Simulation System (3D-LSS) which is to analyze whistle sound related marine casualties, and is to evaluate the accident situations using 3D sound by Head Related Transfer Function At first, the hree-dimensional listening model from the analysis of accident situations is proposed, and then the reproduction and evaluation methods of 3D sounds are also discussed. The system is designed to explain the accident situations and to simulate the possible situations with GUI based graphics and 3D sound reproduction. The evaluation experiments using 3D-LSS are carried out with six cases that did not known whether it is true or not the blast and listening of the whistle sound between two vessels. As results of psychological assessments by five subjects, the six cases can be analyzed clearly by visual images and audio sounds, thus the usability of 3D-LSS as one of the judgment assistant system of marine casualty is verified.
https://doi.org/10.5394/KINPR.2004.28.8.659 인용 PDF KSCI

Audio Listening Enhancement in Adverse Environment based on Loudness Restoration (라우드니스 복원에 기반한 잡음 환경에서의 오디오 청취 향상)

Pak, Junhyeong;Shin, Jong Won
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.12
- /
- pp.210-216
- /
- 2013
It is hard to listen to the music clearly in the presence of background noise. In this paper, a method that modifies the audio signal automatically to enhance the audio listening experience in adverse environment is proposed. Specifically, the method that amplifies the audio signal so that the perceived loudness of audio signal in each band becomes similar to that of the noiseless signal. The loudness perception model proposed by Moore et. al is utilized. Extending the previous work that is applied to speech reinforcement, the full band signal sampled at 48kHz is manipulated based on the loudness restoration principle. Moreover, based on the observation that the audio clarity is compromised even with loudness restored signal, a modification that intentionally boosts high frequency loudness more than lower band is also proposed. Experimental results showed that the proposed algorithm can enhance the audio listening experience in adverse environment.
https://doi.org/10.5573/ieek.2013.50.12.210 인용 PDF KSCI

Search Result 51, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)