• Title/Summary/Keyword: Audio Analysis

Search Result 544, Processing Time 0.027 seconds

Dual-Domain Connection Scheme for HE-AAC and MPEG Surround

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.1E
    • /
    • pp.29-34
    • /
    • 2009
  • MPEG4 High Efficiency Advanced Audio Coding (HE-AAC) and MPEG Surround are one of the most efficient combinations for low bit rate multi-channel audio coding. Based on the fact that these two codecs have identical quadrature mirror filter (QMF) analysis and synthesis structures, we propose a dual-domain connection scheme for the codecs. Specifically two time-domain connection methods are analyzed and compared to the QMF subband-domain connection method. Experimental results show that both the time-domain connection methods cause no subjective sound quality degradation compared to the QMF subband-domain connection method, which verifies that one can select either of them depending on application scenarios.

Performance Analysis of Watermarking using Audio and Image Watermark in Wireless Channel Environment (무선 전송 채널 환경에서 오디오와 로고 영상을 이용한 워터마킹 성능분석)

  • Kim, Yoon-Ho;Park, Ki-Hong
    • Journal of Advanced Navigation Technology
    • /
    • v.10 no.4
    • /
    • pp.406-412
    • /
    • 2006
  • In this paper, we analyzed the performance of digital watermarking by using audio signal as well as logo image watermark. By utilizing the OFDM/QPSK system under AWGN channel environment, watermarked image are transmitted and detected. Experimental results showed that audio signal-based watermark embedding scheme is superior to that of logo image-based, which is able to restore a signal at SNR=3[dB].

  • PDF

The Study for PAGA Coverage Requirements and Analysis Procedure (선박 및 해양플랜트 PAGA Coverage Rule Study 및 해석 사례)

  • Lee, Sung-Ju;Park, Hyung-Sik;Park, No-Jun;Kwun, Hyuk;Suh, Yong-Suk;Seo, Jong-Soo
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2014.10a
    • /
    • pp.20-22
    • /
    • 2014
  • 선박, 해양플랜트에 적용되는 PAGA System 는 선내 모든 선원 및 승객에 대하여 정보전달 기능뿐 아니라 위급 상황 발생시 신속한 상황 전달 및 대피를 위한 안전상의 이유로서 매우 중요시 되고 있다. PAGA 의 핵심이 되는 Audio Coverage 의 경우, 초기 설계단계에서 해석을 통해 스피커 배치가 이루어지는데, 본 논문에서는 이러한 PAGA Audio Coverage 관련 요구조건과 합리적인 적용방안, 해석 및 평가 방안에 대하여 사례를 통해 소개하고자 한다.

  • PDF

Room acoustic measurement and analysis (실내 음향 측정 및 분석 기법 연구)

  • Hong Seung-Wook;Jeon Jin-Ho;Lee Sin-Lyul
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.557-560
    • /
    • 2004
  • 본 논문에서는 실내음향을 측정하기 위해 사용되는 음향 측정 신호인 blank pistol, maximum length sequence, sine sweep 을 이용한 충격응답 특성을 분석한다. 각 충격응답의 실내음향 인자들을 먼저 비교 분석하고 sine sweep 속도에 따른 충격응답 특성을 분석하여 최적의 실내음향 인자를 찾기 위한 sine sweep 속도를 결정한다.

  • PDF

Retrieval of Player Event in Golf Videos Using Spoken Content Analysis (음성정보 내용분석을 통한 골프 동영상에서의 선수별 이벤트 구간 검색)

  • Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.674-679
    • /
    • 2009
  • This paper proposes a method of player event retrieval using combination of two functions: detection of player name in speech information and detection of sound event from audio information in golf videos. The system consists of indexing module and retrieval module. At the indexing time audio segmentation and noise reduction are applied to audio stream demultiplexed from the golf videos. The noise-reduced speech is then fed into speech recognizer, which outputs spoken descriptors. The player name and sound event are indexed by the spoken descriptors. At search time, text query is converted into phoneme sequences. The lists of each query term are retrieved through a description matcher to identify full and partial phrase hits. For the retrieval of the player name, this paper compares the results of word-based, phoneme-based, and hybrid approach.

Sinusoidal Modeling of Polyphonic Audio Signals Using Dynamic Segmentation Method (동적 세그멘테이션을 이용한 폴리포닉 오디오 신호의 정현파 모델링)

  • 장호근;박주성
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.4
    • /
    • pp.58-68
    • /
    • 2000
  • This paper proposes a sinusoidal modeling of polyphonic audio signals. Sinusoidal modeling which has been applied well to speech and monophonic signals cannot be applied directly to polyphonic signals because a window size for sinusoidal analysis cannot be determined over the entire signal. In addition, for high quality synthesized signal transient parts like attacks should be preserved which determines timbre of musical instrument. In this paper, a multiresolution filter bank is designed which splits the input signal into six octave-spaced subbands without aliasing and sinusoidal modeling is applied to each subband signal. To alleviate smearing of transients in sinusoidal modeling a dynamic segmentation method is applied to subbands which determines the analysis-synthesis frame size adaptively to fit time-frequency characteristics of the subband signal. The improved dynamic segmentation is proposed which shows better performance about transients and reduced computation. For various polyphonic audio signals the result of simulation shows the suggested sinusoidal modeling can model polyphonic audio signals without loss of perceptual quality.

  • PDF

Online Monaural Ambient Sound Extraction based on Nonnegative Matrix Factorization Method for Audio Contents (오디오 컨텐츠를 위한 비음수 행렬 분해 기법 기반의 실시간 단일채널 배경 잡음 추출 기법)

  • Lee, Seokjin
    • Journal of Broadcast Engineering
    • /
    • v.19 no.6
    • /
    • pp.819-825
    • /
    • 2014
  • In this paper, monaural ambient component extraction algorithm based on nonnegative matrix factorization (NMF) is described. The ambience component extraction algorithm in this paper is developed for audio upmixing system; Recent researches have shown that they can enhance listener envelopment if the extracted ambient signal is applied into the multichannel audio upmixing system. However, the conventional method stores all of the audio signal and processes all at once, so it cannot be applied to streaming system and digital signal processor (DSP) system. In this paper, the ambient component extraction algorithm based on on-line nonnegative matrix factorization is developed and evaluated to solve the problem. As a result of analysis of the processed signal with spectral flatness measures in the experiment, it was shown that the developed system can extract the ambient signal similarly with the conventional batch process system.

On-Line Audio Genre Classification using Spectrogram and Deep Neural Network (스펙트로그램과 심층 신경망을 이용한 온라인 오디오 장르 분류)

  • Yun, Ho-Won;Shin, Seong-Hyeon;Jang, Woo-Jin;Park, Hochong
    • Journal of Broadcast Engineering
    • /
    • v.21 no.6
    • /
    • pp.977-985
    • /
    • 2016
  • In this paper, we propose a new method for on-line genre classification using spectrogram and deep neural network. For on-line processing, the proposed method inputs an audio signal for a time period of 1sec and classifies its genre among 3 genres of speech, music, and effect. In order to provide the generality of processing, it uses the spectrogram as a feature vector, instead of MFCC which has been widely used for audio analysis. We measure the performance of genre classification using real TV audio signals, and confirm that the proposed method has better performance than the conventional method for all genres. In particular, it decreases the rate of classification error between music and effect, which often occurs in the conventional method.

Analysis of Podcast User Behaviors and Classification of Users (팟캐스트 콘텐츠 이용자 행태분석 및 유형 파악)

  • Kang, Minjeong
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.3
    • /
    • pp.94-104
    • /
    • 2022
  • As the audio content market grows due to the spread of the AI speaker market and the influence of connected cars, the demand for podcast service is increasing. Therefore, in this study, the behaviors of podcast users were identified and the user types were classified. In the background study, podcast usage motives and user types were studied, and they were referred to when making the questionnaire. In the survey, preferred audio content was identified according to the situation, and in the in-depth interview, the user type and insights were derived by identifying the audio service usage behavior. As a result of the survey, there was little difference between preferred content for single listening and multitasking, but the difference in preferred content according to time period was statistically significant. The three user types derived from the in-depth interview were divided into users who listen alone for the purpose of study, find and listen to useful information quickly while on the go, and multitask and listen to the light and comfortable contents. It is expected that the results of this study will be an important reference for designing an audio content platform to improve user experience.

Implementation of an Intelligent Audio Graphic Equalizer System (지능형 오디오 그래픽 이퀄라이저 시스템 구현)

  • Lee Kang-Kyu;Cho Youn-Ho;Park Kyu-Sik
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.3 s.309
    • /
    • pp.76-83
    • /
    • 2006
  • A main objective of audio equalizer is for user to tailor acoustic frequency response to increase sound comfort and example applications of audio equalizer includes large-scale audio system to portable audio such as mobile MP3 player. Up to now, all the audio equalizer requires manual setting to equalize frequency bands to create suitable sound quality for each genre of music. In this paper, we propose an intelligent audio graphic equalizer system that automatically classifies the music genre using music content analysis and then the music sound is boosted with the given frequency gains according to the classified musical genre when playback. In order to reproduce comfort sound, the musical genre is determined based on two-step hierarchical algorithm - coarse-level and fine-level classification. It can prevent annoying sound reproduction due to the sudden change of the equalizer gains at the beginning of the music playback. Each stage of the music classification experiments shows at least 80% of success with complete genre classification and equalizer operation within 2 sec. Simple S/W graphical user interface of 3-band automatic equalizer is implemented using visual C on personal computer.