• Title/Summary/Keyword: Sound Processing

Search Result 620, Processing Time 0.024 seconds

Calculation Model of Time Varying Loudness by Using the Critical-banded Filters (임계 대역 필터를 이용한 과도음의 라우드니스 계산 모델)

  • Jeong, Hyuk;Ih, Jeong-Guon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.65-70
    • /
    • 2000
  • It is blown that the loudness is one of the most important metrics in assessing the sound quality and a calculation method for loudness has been standardized for steady sounds. In this study, a new loudness model is suggested for dealing with the transient sound for a unified analysis of various practical sounds. A signal processing technique is introduced for this purpose, which is required for the band subdivision and the prediction of band-level change of transient sounds. In addition, models for the post-masking and the temporal integration are adopted in the analysis of the loudness of transient sounds. In order to solve the problem of the conventional loudness model in the pure-tone signal processing, a critical band filter is employed in the analysis, which consists of 47 critical filters having a filter spacing of a half of the critical bandwidth. For testing the effectiveness of the present model, the predicted responses are compared with the experimental data and it is observed that they are in good agreements.

  • PDF

Watermarking Algorithm for Copyright Protection of Haegeum Sound Contents (해금 사운드 콘텐츠의 저작권 보호를 위한 워터마킹 알고리듬)

  • Hong, Yeon-Woo;Kang, Myeong-Su;Cho, Sang-Jin;Chong, Ui-Pil
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.10 no.4
    • /
    • pp.214-219
    • /
    • 2009
  • This paper proposes a watermarking algorithm considering the frequency characteristics of Haegeum sounds for copyright protection of digital Haegeum sound contents. The harmonics of Haegeum sounds commonly have large magnitude values in 1500Hz~2000Hz and 2800Hz~3500Hz so that those bands are selected to embed a watermark. The proposed method computes the FFT (fast Fourier transform) of the original sound signal and embeds the watermark bits generated by PN (pseudo noise) sequence into the harmonics in the selected bands. Furthermore, the proposed method is robust to lowpass filter, bandpass filter, cropping, noise addition, MP3 compression attacks and the maximum BER (bit error rate) is 1.41% after lowpass filter attack. To measure the quality of the watermarked sound, subjective listening test, MUSHRA (multiple stimuli with hidden reference and anchor), was conducted. The mean value of MUSHRA listening test is bigger than 98 and 96.67 for every Haegeum sounds and Korean classical music with Haeguem, respectively.

  • PDF

Spatial Speaker Localization for a Humanoid Robot Using TDOA-based Feature Matrix (도착시간지연 특성행렬을 이용한 휴머노이드 로봇의 공간 화자 위치측정)

  • Kim, Jin-Sung;Kim, Ui-Hyun;Kim, Do-Ik;You, Bum-Jae
    • The Journal of Korea Robotics Society
    • /
    • v.3 no.3
    • /
    • pp.237-244
    • /
    • 2008
  • Nowadays, research on human-robot interaction has been getting increasing attention. In the research field of human-robot interaction, speech signal processing in particular is the source of much interest. In this paper, we report a speaker localization system with six microphones for a humanoid robot called MAHRU from KIST and propose a time delay of arrival (TDOA)-based feature matrix with its algorithm based on the minimum sum of absolute errors (MSAE) for sound source localization. The TDOA-based feature matrix is defined as a simple database matrix calculated from pairs of microphones installed on a humanoid robot. The proposed method, using the TDOA-based feature matrix and its algorithm based on MSAE, effortlessly localizes a sound source without any requirement for calculating approximate nonlinear equations. To verify the solid performance of our speaker localization system for a humanoid robot, we present various experimental results for the speech sources at all directions within 5 m distance and the height divided into three parts.

  • PDF

Realtime Stereo Sound Image Expansion System Using Hass Effect& Phase shifting (선착효과 및 위상처리를 이용한 실시간 스테레오 음상 확장 시스템 구현)

  • 이종철;이상훈
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1227-1230
    • /
    • 1998
  • Phase control methods are used to expand the sound image in general AV system. However, these methods are effective only to the signal under 1kHz, and the listener must be located in front center of the speaker system. In this paper, we realize the realtime processing system in which phase shifting method is dominant at low frequency and precedence effect is dominant at high frequency. Two sound cards are used to process the audio signal in realtime with 16 bits stereo channel of 44.1 kHz sampling frequency. And the analog circuit is designed to process the phase shifting. In experiments the usefulness of the proposed stereo system is confirmed.

  • PDF

Improving Low Frequency Signal Reproduction in TV Audio (TV 스피커의 저주파수 신호 재생 개선)

  • Arora Manish;Oh Yoonhark;Kim SeoungHun;Lee Hyuckjae;Jang Seongcheol
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.275-278
    • /
    • 2004
  • In TV sound system, loudspeakers are subject to severe size constraints. The small size of the transducer affects the low frequency signal performance of the system. Bass signal performance contributes significantly to the user perceived sound quality and a good bass signal reproduction is essential. Increasing the sound energy in the bass signal range is an unviable solution since the gain required are exceedingly high and signal distortion occurs because of the speaker overload. Recently methods are being proposed to invoke low frequency illusion using psychoacoustic phenomena of the missing fundamental. This paper proposes a simple and effective signal processing method to create bass signal illusion in TV speakers using the missing fundamental effect, at a complexity of 12 MIPS on Motorola 56371 audio DSP.

  • PDF

On-line Detection of Cracks in Eggshell (계란 크랙의 온라인 검출)

  • 최완규;조한근;백진하;장영창;연광석;조성찬
    • Journal of Biosystems Engineering
    • /
    • v.24 no.3
    • /
    • pp.253-258
    • /
    • 1999
  • This study was conducted to develop an automatic egg inspection system for detecting creaked eggs based on acoustic impulse response. This system includes a sound generator, a sound sensor with signal conditioner, and a computer. The sound generator that hit the sharp of the dull edges of an egg was constructed with a ceramic ball pendulum attached to a rotary type solenoid. The signal conditioner included a pre-amplifier and a digital signal processing (DSP) board. The parameters for distinguishing cracked and normal eggs were the area, the geometric centroid and the resonance frequency of power spectrum of the acoustic signal generated. An algorithm for on-line detection of the continuous transferring eggs was developed. The performance tests resulted with 91% success rate to separate cracked and normal eggs at the rate of 1 second per an egg.

  • PDF

Lip Region Extraction by Gaussian Classifier (가우스 분류기를 이용한 입술영역 추출)

  • Kim, Jeong Yeop
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.2
    • /
    • pp.108-114
    • /
    • 2017
  • Lip reading is a field of image processing to assist the process of sound recognition. In some environment, the capture of sound signal usually has significant noise and therefore, the recognition rate of sound signal decreases. Lip reading can be a good feature for the increase of recognition rates. Conventional lip extraction methods have been proposed widely. Maia et. al. proposed a method by the sum of Cr and Cb. However, there are two problems as follows: the point with maximum saturation is not always regarded as lips region and the inner part of lips such as oral cavity and teeth can be classified as lips. To solve these problems, this paper proposes a method which adopts the histogram-based classifier for the extraction of lips region. The proposed method consists of two stages, learning and test. The amount of computation is minimized because this method has no color conversion. The performance of proposed method gives 66.8% of detection rate compared to 28% of conventional ones.

Root-Cause Investigation of Abnormal Sound from a Heat Exchanger of Condensate Water System in a Nuclear Power Plant (원전 복수계통 열교환기의 이음발생 원인규명)

  • Lee, Jun-Shin;Kim, Tae-Ryong;Lee, Wook-Ryun;Sohn, Seok-Man;Yoon, Seok-Bon;Kim, Man-Hee
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2006.05a
    • /
    • pp.1306-1311
    • /
    • 2006
  • The root cause of abnormal sound from a heat exchanger of condensate water system in a nuclear power plant is investigated by using the impact signal-processing methodology based on the Hertz theory. The predicted results for the location of impact force and the loose part size meet good agreement with the identified materials during the overhaul period in the plant. Nuclear power plants have experienced several loose parts and the frequency of the loose part will be increased along the aging of the plants. So, this analysis methodology for the impact signal will be widely utilized for the primary and secondary side of the nuclear power plant.

  • PDF

Optimal sequencing of 1D acoustic system for sound transmission loss maximization using topology optimization method (전달손실 최대화를 위한 위상최적화기반 1차원 흡차음시스템의 최적 배열 설계)

  • Kim, Eun-Il;Lee, Joong-Seok;Kim, Yoon-Young;Kim, Jung-Soo;Kang, Yeon-June
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2007.04a
    • /
    • pp.309-314
    • /
    • 2007
  • Optimal layer sequencing of a multi-layered acoustical foam is solved to maximize its sound transmission loss. A foam consisting of air and poroelastic layers can be optimized when a limited amount of a poroelastic material is allowed. By formulating the sound transmission loss maximization problem as a one dimensional topology optimization problem, optimal layer sequencing and thickness were systematically found for several frequencies. For optimization, the transmission losses of air and poroelastic layers were calculated by the transfer matrix derived from Biot's theory. By interpolating five intrinsic parameters among several poroelastic material parameters, dear air-poroelastic layer distributions were obtained; no filtering or post-processing was necessary. The optimized foam layouts by the proposed method were shown to differ depending on the frequency of interest.

  • PDF

Design Space Exploration of Many-Core Architecture for Sound Synthesis of Guitar on Portable Device (휴대 장치용 기타 음 합성을 위한 매니코어 아키텍처의 디자인 공간 탐색)

  • Kang, Myeongsu;Kim, Jong-Myon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2014.01a
    • /
    • pp.1-4
    • /
    • 2014
  • Although physical modeling synthesis is becoming more and more efficient in rich and natural high-quality sound synthesis, its high computational complexity limits its use in portable devices. This constraint motivated research of single-instruction multiple-data many-core architectures that support the tremendous amount of computations by exploiting massive parallelism inherent in physical modeling synthesis. Since no general consensus has been reached which grain sizes of many-core processors and memories provide the most efficient operation for sound synthesis, design space exploration is conducted for seven processing element (PE) configurations. To find an optimal PE configuration, each PE configuration is evaluated in terms of execution time, area and energy efficiencies. Experimental results show that all PE configurations are satisfied with the system requirements to be implemented in portable devices.

  • PDF