• Title/Summary/Keyword: Image and sound

Search Result 455, Processing Time 0.029 seconds

Perceptual Localization of a phantom sound image for Ultrahigh-Definition TV (UHD TV를 위한 가상 음상의 인지 위치)

  • Lee, Young-Woo;Kim, Sun-Min
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.5
    • /
    • pp.9-17
    • /
    • 2010
  • This paper presents a localization perception of a phantom sound image for ultrahigh-definition TV with respect to various loudspeaker configurations; two-horizontal, two-vertical and triplet loudspeakers. Vector base amplitude panning algorithm with modification for non-equidistant loudspeaker setup is applied to create the phantom sound image. In order to practically study the localization performance in real situation, the listening tests were conducted at the on-axis and off-axis positions of TV in normal listening room. A method of adjustment which can reduce the ambiguity of a perceived angle is exploited to evaluate the angles of octave-band signals. The subjects changed the panning angle until the real sound source and virtually panned source were coincident. A spatial blurring can be measured by examining the differences of the panning angles perceived with respect to each band. The listening tests show that the triplet panning method has better performance than vertical panning in view of perceptual localization and spatial blurring at both on-axis and off-axis positions.

Annoyance and sportiness perception of the acceleration sound by the driver and passengers (가속 사운드에 대한 운전자와 탑승객의 성가심과 스포티함 지각)

  • Kim, Seonghyeon;Altinsoy, M. Ercan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.6
    • /
    • pp.566-570
    • /
    • 2021
  • This study presents a perceptual difference in acceleration sounds of a sporty sedan between the driver and passenger. We found a significant difference in annoyance and sportiness perception according to the acceleration sound level through subjective evaluations. The multimodal reproduction system, which can reproduce the driving image, motion, vibration, and sound, was applied for the test. A subjective experiment was conducted to evaluate the perceived intensity of annoyance and sportiness by varying the acceleration sound level in five steps of 3 dB. The experimental results showed that the driver perceives the acceleration sound less annoying than the passenger at a relatively low sound level. Meanwhile, the driver has perceived the acceleration sound more sporty than the passenger at a relatively high sound level. Moreover, it was found that passengers were 35 % less sensitive to an annoyance than drivers, whereas the driver was 74 % more susceptible to sportiness than passengers according to the sound level change. This finding is expected to be applied as a sound design strategy that differentiates the acceleration sound level in active sound design.

A Study on Visual and Auditory Inducement of VR Image Contents and the Inducement Components of for Immersion Improvement (몰입감 향상을 위한 VR 영상 콘텐츠의 시청각 유도와 구성요소에 관한 연구)

  • Lee, Lang-Goo;Chung, Jean-Hun
    • Journal of Digital Convergence
    • /
    • v.14 no.11
    • /
    • pp.495-500
    • /
    • 2016
  • Since 2016, the VR market has been on the rapid growth. The most critical and arising issue in the VR market is VR contents. That is because it is necessary to develop making techniques and various VR contents to satisfy users' immersion and interaction as much as possible. Therefore, this study focused on VR image contents, conducted domestic and foreign cases of the components of visual and auditory inducement to keep and improve immersion, and thereby tried to find a right direction of visual and auditory inducement. As a result, the visual and auditory components of visual and auditory inducement were found to be photographing, edition, lighting, stitching, graphics, effect, voice actor's narration, dubbing, character voice, background sound, and sound effect; its technical and content components were found to be photographing technique, edition technique, lighting, stitching, graphics and effect, sound and sound effect, and theatric direction based on Mise-en-Scene, lines and narration of characters, and movements of characters and objets. For VR image contents, not only visual and auditory components, but technical and content components are necessary to improve immersion. In the future, it will be necessary to continue to research them.

Algorithm for Discrimination of Brown Rice Kernels Using Machine Vision

  • C.S. Hwang;Noh, S.H.;Lee, J.W.
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 1996.06c
    • /
    • pp.823-833
    • /
    • 1996
  • An ultimate purpose of this study is to develop an automatic brown rice quality inspection system using image processing technique. In this study emphasis was put on developing an algorithm for discriminating the brown rice kernels depending on their external quality with a color image processing system equipped with an adaptor for magnifying the input image and optical fiber for oblique illumination. Primarily , geometrical and optical features of sample images were analyzed with unhulled paddy and various brown rice kernel samples such as sound, cracked, green-transparent , green-opaque, colored, white-opaque and brokens. Secondary, an algorithm for discrimination of the rice kernels in static state was developed on the basis of the geometrical and optical parameters screened by a statistical analysis(STEPWISE and DISCRIM Procedure, SAS ver.6). Brown rice samples could be discriminated by the algorithm developed in this study with an accuracy of 90% to 96% for the sound , cracked, colored, broken and unhulled , about 81% for the green-transparent and the white-opaque and about 75% for the green-opaque, respectively. A total computing time required for classification was about 100 seconds/1000 kernels with the PC 80486-DX2, 66MHz.

  • PDF

A Study on Enhancement of 3D Sound Using Improved HRTFS (개선된 머리전달함수를 이용한 3차원 입체음향 성능 개선 연구)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.6
    • /
    • pp.557-565
    • /
    • 2009
  • To perceive the direction and the distance of a sound, we always use a couple of information. Head Related Transfer Function (HRTF) contains the information that sound arrives from a sound source to the ears of the listener, like differences of level, phase and frequency spectrum. For a reproduction system using 2 channels, we apply HRTF to many algorithms which make 3d sound. But it causes a problem to localize a sound source around a certain places which is called the cone-of-confusion. In this paper, we proposed the new algorithm to reduce the confusion of sound image localization. The difference of frequency spectrum and psychoacoustics theory are used to boost the spectral cue among each directions. To confirm the performance of the algorithm, informal listening tests are carried out. As a result, we can make the improved 3d sound in 2 channel system based on a headphone. Also sound quality of improved 3d sound is much better than conventional methods.

Improving a Sound Localization Using 1/3-octave Band Pass Filter (1/3-옥타브 대역통과필터를 이용한 음상정위기법 성능 향상)

  • Hwang, Shin;Yang, Jin-Woo;Cheung, Wan-Sup;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.98-103
    • /
    • 2001
  • The binaural auditory system of human has the capability of differentiating the direction and distance of sound sources. This feature is well characterised in terms of the inter-aural intensity difference (IID), the inter-aural time difference (ITD) and/or the spectral shape difference (SSD) arising from the acoustic transfer of a sound source to the outer ears. This paper proposes an effective way of extracting the three sound perception factors (IID, ITD, SSD) from the head-related transfer functions (HRTF's) that depends on the direction and distance of the acoustic source from the listener. It includes the estimation method of the equivalent ITD and 1/3-octave band-based IID factors and their usage to locate a sound source in space. Subjective and objective tests were carried out to examine the effectiveness of the proposed methodology and its applicability to real sound systems. Those experimental results are illustrated in this paper.

  • PDF

Synthesis of 3D Sound Movement by Embedded DSP

  • Komata, Shinya;Sakamoto, Noriaki;Kobayashi, Wataru;Onoye, Takao;Shirakawa, Isao
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.117-120
    • /
    • 2002
  • A single DSP implementation of 3D sound movement is described. With the use of a realtime 3D acoustic image localization algorithm, an efficient approach is devised for synthesizing the 3D sound movement by interpolating only two parameters of "delay" and "gain". Based on this algorithm, the realtime 3D sound synthesis is performed by a commercially available 16-bit fixed-point DSP with computational labor of 65 MIPS and memory space of 9.6k words, which demonstrates that the algorithm call be used even for the mobile applications.

  • PDF

Proposal of a new method for learning of diesel generator sounds and detecting abnormal sounds using an unsupervised deep learning algorithm

  • Hweon-Ki Jo;Song-Hyun Kim;Chang-Lak Kim
    • Nuclear Engineering and Technology
    • /
    • v.55 no.2
    • /
    • pp.506-515
    • /
    • 2023
  • This study is to find a method to learn engine sound after the start-up of a diesel generator installed in nuclear power plant with an unsupervised deep learning algorithm (CNN autoencoder) and a new method to predict the failure of a diesel generator using it. In order to learn the sound of a diesel generator with a deep learning algorithm, sound data recorded before and after the start-up of two diesel generators was used. The sound data of 20 min and 2 h were cut into 7 s, and the split sound was converted into a spectrogram image. 1200 and 7200 spectrogram images were created from sound data of 20 min and 2 h, respectively. Using two different deep learning algorithms (CNN autoencoder and binary classification), it was investigated whether the diesel generator post-start sounds were learned as normal. It was possible to accurately determine the post-start sounds as normal and the pre-start sounds as abnormal. It was also confirmed that the deep learning algorithm could detect the virtual abnormal sounds created by mixing the unusual sounds with the post-start sounds. This study showed that the unsupervised anomaly detection algorithm has a good accuracy increased about 3% with comparing to the binary classification algorithm.

Objective and Subjective Test of a Virtual Sound Reproduction Using a Headphone (헤드폰을 이용한 가상음향 재현의 주관적, 객관적 평가)

  • 최원재;김상명
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2003.05a
    • /
    • pp.611-616
    • /
    • 2003
  • The headphone is regarded as the most effective means for reproducing 3-dimentional virtual sound due to its channel separation property. However, there still exist several serious problems in headphone reproduction, such as, 'front-back confusion' and in-head localization'. These well-known problems are in general assessed by the subjective test that is based on human judgment. In this paper, an objective test is conducted in parallel with the subject test in order to validate the objective reproduction performance. Such a combined approach may be a more scientific and systematic approach to the reproduction performance.

  • PDF

Aesthetic Study of Film Sound Inherent in Hitchcock's (히치콕 <사이코>에 내재된 영화 사운드의 미학적 고찰)

  • Park, Byung-Kyu
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.6
    • /
    • pp.26-33
    • /
    • 2014
  • From a film esthetic point of view, this paper deals with all the sound elements which are speech, noise, and music for the signification of sound in Hitchcock's . The speech makes a mental image auditory through voice-over, and sometimes it has the indiscernibleness of life and death to be incarnate. This paper has demonstrated that the noise also can mark punctuation-narrative boundary besides visual techniques pointed out by Metz, and it cites the sound of falling water which completes shower scene, offsetting a scream in audience's mind. In the music, desire and oppression are symbolized and they are making a dissonance. Upon occasion, the coexistence of two chords represents duplicity in Norman-mother. Also, the music may disappear in the way of silence, being mummified in the time paused. Thus, the common filmic signification of sounds in can be called reconceptualization of the image.