• Title/Summary/Keyword: Audio Effect

Search Result 184, Processing Time 0.029 seconds

The Performance Analysis of On-line Audio Genre Classification (온라인 오디오 장르 분류의 성능 분석)

  • Yun, Ho-Won;Jang, Woo-Jin;Shin, Seong-Hyeon;Park, Ho-Chong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2016.11a
    • /
    • pp.23-24
    • /
    • 2016
  • 본 논문에서는 온라인 오디오 장르 분류의 성능을 비교 분석한다. 온라인 동작을 위해 1초 단위의 오디오 신호를 입력하여 music, speech, effect 중 하나의 장르로 판단한다. 학습 방법은 GMM과 심층 신경망을 사용하며, 특성은 MFCC와 스펙트로그램을 포함하는 네 가지 종류의 벡터를 사용한다. 각 성능을 비교 분석하여 장르 분류에 적합한 학습 방법과 특성 벡터를 확인한다.

  • PDF

Design of Emergency Evacuation Guiding System with Serially Connected Multi-channel Speakers (직렬 스피커 연결을 이용한 비상 대피 유도 시스템의 설계)

  • Chung, Han-Vit;Kim, Tea-Wan;Chung, Yun-Mo
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.4
    • /
    • pp.142-152
    • /
    • 2011
  • In general, existing emergency evacuation guiding systems depend on visual techniques like emergency lights or LEDs. Actually people in the case of fire emergency condition may not obtain a range of view because of smoke from the fire. This paper introduces a technique to design an emergency guiding system using directivity sound to cope with this problem. In this case all speakers are serially connected for audio signal transmission in a serial fashion to achieve convenient speaker installation. Floyd algorithm is used to find shortest evacuation paths. Because serially connected multi-channel speakers are weak in case of disconnection, this paper uses a technique to solve the diagnostic problem. In the proposed system, a PC based on the USB protocol is used for control and observation. The system has achievements, such as increasing evacuation rate under emergency conditions, and serial transmission of audio signal for easy maintenance and low installation cost.

Visual Image Effects on Sound Localization in Peripheral Region under Dynamic Multimedia Conditions

  • Kono, Yoshinori;Hasegawa, Hiroshi;Ayama, Miyoshi;Kasuga, Masao;Matsumoto, Shuichi;Koike, Atsushi;Takagi, Koichi
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.702-705
    • /
    • 2002
  • This paper describes effects of visual information influencing sound localization in the peripheral visual Held under dynamic conditions. Presentation experiments of an audio-visual stimulus were carried out using a movie of a moving patrol car and its siren sound. The tallowing results were obtained: first, the sound image on the timing at the beginning of the presentation was more strongly captured by the visual image than that at the end, i.e., the "beginning effect" was occurred; second, in the peripheral regions, the "beginning effect" was strongly appeared in near the fixation point of eyes.

  • PDF

An Expansion of Sound Image Using Phase Shifting of Low Frequency and Reflected Sound of High Frequency in Television (저역 위상 처리와 고역 반사음을 이용한 텔레비전에서의 음상 확장)

  • 김동수;박해광
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1235-1238
    • /
    • 1998
  • In television stereo system, to produce the sound image for spatial impression is too difficult because of the narrow distence between two speakers. A method of widening the sound image using precedence effect was introduced but it didn't work effectively in low frequency band. In this paper, we propose a new method to produce an expanded sound image in full band of audio requency in band sound is expanded by phase shifting method and a higher frequency band sound is expanded by reflection. In simulation and experiment, the proposed system guarantees useful effect of sound image expansion in television stereo system.

  • PDF

A Study on Center Speaker in Television Receiver with Sound Image Expansion (음상 확장 기능을 갖는 텔레비전 수상기에서 센터 스피커에 관한 연구)

  • 이상훈;김동수
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1231-1234
    • /
    • 1998
  • Many signal processing methods of widening the sound image for spatial impression have been studied. Most typical methods of widening the sound image are related to the phase shifting and precedence effect. However, these methods are not effective in center sound image. As listener's position moves from center to outside, the center sound image is shifted to the speaker. That is to say, the directional localization of center sound image is unstable. In this paper, we propose a television audio system including center speaker, and analyze the role of center speaker using theory of Makida and precedence effect. In experiments, we confirm the usefulness of the center speaker for the stability of center sound image.

  • PDF

Realtime Stereo Sound Image Expansion System Using Hass Effect& Phase shifting (선착효과 및 위상처리를 이용한 실시간 스테레오 음상 확장 시스템 구현)

  • 이종철;이상훈
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1227-1230
    • /
    • 1998
  • Phase control methods are used to expand the sound image in general AV system. However, these methods are effective only to the signal under 1kHz, and the listener must be located in front center of the speaker system. In this paper, we realize the realtime processing system in which phase shifting method is dominant at low frequency and precedence effect is dominant at high frequency. Two sound cards are used to process the audio signal in realtime with 16 bits stereo channel of 44.1 kHz sampling frequency. And the analog circuit is designed to process the phase shifting. In experiments the usefulness of the proposed stereo system is confirmed.

  • PDF

Audio genre classification using deep learning (딥 러닝을 이용한 오디오 장르 분류)

  • Shin, Seong-Hyeon;Jang, Woo-Jin;Yun, Ho-won;Park, Ho-Chong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2016.06a
    • /
    • pp.80-81
    • /
    • 2016
  • 본 논문에서는 딥 러닝을 이용한 오디오 장르 분류 기술을 제안한다. 장르는 music, speech, effect 3가지로 정의하여 분류한다. 기존의 GMM을 이용한 장르 분류 기술은 speech의 인식률에 비해 music과 effect에 대한 인식률이 낮아 각 장르에 대한 인식률의 차이를 보인다. 이러한 문제를 해결하기 위해 본 논문에서는 딥 러닝을 이용해 높은 수준의 추상화 과정을 거쳐 더 세분된 학습을 진행한다. 제안한 방법을 사용하면 미세한 차이의 특성까지 학습해 장르에 대한 인식률의 차이를 줄일 수 있으며, 각 장르에 대해 GMM을 이용한 오디오 장르 분류보다 높은 인식률을 얻을 수 있다.

  • PDF

Auditory and Visual Information Effect on the Loudness of Noise (시각 및 청각 정보가 소음의 인지도에 미치는 영향)

  • Shin, Hoon;Park, Sa-Gun;Song, Min-Jeong;Jang, Gil-Soo
    • KIEAE Journal
    • /
    • v.6 no.4
    • /
    • pp.69-76
    • /
    • 2006
  • The effects of the additional visual and auditory stimuli on the loudness evaluation of road traffic noise was investigated by the method of magnitude estimation. As a result, it was shown that additional visual stimulus of noise barrier can influence on the loudness perception of road traffic noise. Also, additional auditory stimuli such as green music or sound of flowing water can influence on the loudness perception of road traffic noise, approximately 5~10% lower than the absence of stimuli. But this effect was disappeared in the range of over 65dB(A).

Analysis on Protection Ratio of IBAC DAB System for Co-Channel FM Interferer (동일채널 FM 간섭원에 대한 IBAC DAB 시스템의 혼신 보호비 분석)

  • Jeong, Young-Ho;Park, So-Ra;Kim, Geon;Lee, Hyun;Lee, Soo-In
    • Journal of Broadcast Engineering
    • /
    • v.5 no.2
    • /
    • pp.199-210
    • /
    • 2000
  • The IBAC (In-Band Adjacent-Channel) DAB (Digital Audio Broadcasting) system is to provide multichannel CD quality audio services and multimedia data services including text and picture in FM band (88~105 MHz). As the FM band is being used by the existing analog radio broadcasting, there must he an analysis of the interference effect between IBAC DAB and analog FM signal. Therefore, the protection ratio should be evaluated to verify the system compatibility and allocate the new IBAC DAB channel in FM band. In this paper, among the three types of interferences, FM-to-DAB, DAB-to-FM and DAB-to-DAB, that can be occurred, the Protection ratio of IBAC DAB system for co-channel FM interferer is analyzed by modeling the FM interferer and considering the multipath fading channel. The simulation results show that IBAC DAB system has far better sensitivity than Eureka 147 and needs a relatively high protection ratio for co-channel FM interferer, because of its narrow bandwidth, about one third of that of Eureka 147.

  • PDF

A Study of the spatial perception by audio-visual information (시각과 청각에 의한 공간적 지각에 관한 연구)

  • Lee, Chai-Bong;Kang, Dae-Gee
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.2
    • /
    • pp.132-136
    • /
    • 2010
  • Psychophysical experiment was performed to investigate how audio-visual spatial disparity affects on perceptual space in peripheral vision. In the experiment, participants were exposed to two stimuli of vision and sound which comes simultaneously from different directions, respectively. The visual stimulus was implemented by 7 white LEDs which were located at an equal distance with 7 different angles of $-70^{\circ}$, $-40^{\circ}$, $-20^{\circ}$, $0^{\circ}$, $20^{\circ}$, $40^{\circ}$, and $70^{\circ}$ from the right front. Those audial stimuli were also implemented by loudspeakers which were placed at 9 different directions equally spaced by $5^{\circ}$ ranged from $-20^{\circ}$ to $20^{\circ}$. Each participant then evaluated spatial disparity between visual and audial stimuli with 5 levels of response, in which the higher level indicates the larger gap. When the visual stimulus is applied from the right, the results show that the response level gets higher for a larger angle between visual and auditory stimuli. A similar tendency for the visual stimulus with $0^{\circ}$ orientation was also be observed. On the other hand, when the visual stimulus is applied from the left, the response level gets lower for the larger angle.