• Title/Summary/Keyword: 음향데이터

Search Result 943, Processing Time 0.027 seconds

Emergency vehicle priority signal system based on deep learning using acoustic data (음향 데이터를 활용한 딥러닝 기반 긴급차량 우선 신호 시스템)

  • Lee, SoYeon;Jang, Jae Won;Kim, Dae-Young
    • Journal of Platform Technology
    • /
    • v.9 no.3
    • /
    • pp.44-51
    • /
    • 2021
  • In general, golden time refers to the most important time in the initial response to accidents such as saving lives or extinguishing fires. The golden time varies from disaster to disaster, but is aimed at five minutes in terms of fire and first aid. However, for the actual site, the average dispatch time for ambulances is 9 minutes and the average transfer time is 17.6 minutes, which is quite large compared to the golden time. There are various causes for this delay, but the main cause is traffic jams. In order to solve the problem, the government has established emergency car concession obligations and secured golden time to prioritize ambulances in places with the highest accident rate, but it is not a solution in rush hour when traffic is increasing rapidly. Therefore, this paper proposed a deep learning-based emergency vehicle priority signal system using collected sound data by installing sound sensors on traffic lights and conducted an experiment to classify frequency signals that differ depending on the distance of the emergency vehicle.

Machine learning based radar imaging algorithm for drone detection and classification (드론 탐지 및 분류를 위한 레이다 영상 기계학습 활용)

  • Moon, Min-Jung;Lee, Woo-Kyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.5
    • /
    • pp.619-627
    • /
    • 2021
  • Recent advance in low cost and light-weight drones has extended their application areas in both military and private sectors. Accordingly surveillance program against unfriendly drones has become an important issue. Drone detection and classification technique has long been emphasized in order to prevent attacks or accidents by commercial drones in urban areas. Most commercial drones have small sizes and low reflection and hence typical sensors that use acoustic, infrared, or radar signals exhibit limited performances. Recently, artificial intelligence algorithm has been actively exploited to enhance radar image identification performance. In this paper, we adopt machined learning algorithm for high resolution radar imaging in drone detection and classification applications. For this purpose, simulation is carried out against commercial drone models and compared with experimental data obtained through high resolution radar field test.

Dialect classification based on the speed and the pause of speech utterances (발화 속도와 휴지 구간 길이를 사용한 방언 분류)

  • Jonghwan Na;Bowon Lee
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.43-51
    • /
    • 2023
  • In this paper, we propose an approach for dialect classification based on the speed and pause of speech utterances as well as the age and gender of the speakers. Dialect classification is one of the important techniques for speech analysis. For example, an accurate dialect classification model can potentially improve the performance of speaker or speech recognition. According to previous studies, research based on deep learning using Mel-Frequency Cepstral Coefficients (MFCC) features has been the dominant approach. We focus on the acoustic differences between regions and conduct dialect classification based on the extracted features derived from the differences. In this paper, we propose an approach of extracting underexplored additional features, namely the speed and the pauses of speech utterances along with the metadata including the age and the gender of the speakers. Experimental results show that our proposed approach results in higher accuracy, especially with the speech rate feature, compared to the method only using the MFCC features. The accuracy improved from 91.02% to 97.02% compared to the previous method that only used MFCC features, by incorporating all the proposed features in this paper.

Clustering and classification of residential noise sources in apartment buildings based on machine learning using spectral and temporal characteristics (주파수 및 시간 특성을 활용한 머신러닝 기반 공동주택 주거소음의 군집화 및 분류)

  • Jeong-hun Kim;Song-mi Lee;Su-hong Kim;Eun-sung Song;Jong-kwan Ryu
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.6
    • /
    • pp.603-616
    • /
    • 2023
  • In this study, machine learning-based clustering and classification of residential noise in apartment buildings was conducted using frequency and temporal characteristics. First, a residential noise source dataset was constructed . The residential noise source dataset was consisted of floor impact, airborne, plumbing and equipment noise, environmental, and construction noise. The clustering of residential noise was performed by K-Means clustering method. For frequency characteristics, Leq and Lmax values were derived for 1/1 and 1/3 octave band for each sound source. For temporal characteristics, Leq values were derived at every 6 ms through sound pressure level analysis for 5 s. The number of k in K-Means clustering method was determined through the silhouette coefficient and elbow method. The clustering of residential noise source by frequency characteristic resulted in three clusters for both Leq and Lmax analysis. Temporal characteristic clustered residential noise source into 9 clusters for Leq and 11 clusters for Lmax. Clustering by frequency characteristic clustered according to the proportion of low frequency band. Then, to utilize the clustering results, the residential noise source was classified using three kinds of machine learning. The results of the residential noise classification showed the highest accuracy and f1-score for data labeled with Leq values in 1/3 octave bands, and the highest accuracy and f1-score for classifying residential noise sources with an Artificial Neural Network (ANN) model using both frequency and temporal features, with 93 % accuracy and 92 % f1-score.

Analysis of an Optimal Iterative Turbo Equalizer for Underwater Acoustic Communication (수중 음향통신에 적합한 최적의 반복기반 터보 등화기 분석)

  • Park, Tae Doo;Lee, Seong Ro;Kim, Beom Mu;Jung, Ji Won
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38C no.3
    • /
    • pp.303-310
    • /
    • 2013
  • Underwater acoustic communication has multipath error because of reflection by sea-level and sea-bottom. The multipath of underwater channel causes signal distortion and error floor. In order to improve the performance, it is necessary to employ an iterative coding scheme. Among the iterative coding scheme, turbo codes and LDPC codes are dominant channel coding schemes in recent. This paper concluded that turbo coding scheme is optimal for underwater communications system in aspect to performance, coded word length, and equalizer combining. Also, we confirmed the performance in the environment of oceanic experimentation using turbo equalizer based on distance 5Km, data rate 1Kbps.

Implementation of an Efficient Music Retrieval System based on the Analysis of User Query Pattern (사용자 질의 패턴 분석을 통한 효율적인 음악 검색 시스템의 구현)

  • Rho, Seung-min;Hwang, Een-jun
    • The KIPS Transactions:PartA
    • /
    • v.10A no.6
    • /
    • pp.737-748
    • /
    • 2003
  • With the popularity of digital music contents, querying and retrieving music contents efficiently from database has become essential. In this paper, we propose a Fast Melody Finder (FMF) that can retrieve melodies fast and efficiently from music database using frequently queried tunes. This scheme is based on the observation that users have a tendency to memorize and query a small number of melody segments, and indexing such segments enables fast retrieval. To handle those tunes, FMF transcribes all the acoustic and common music notational inputs into a specific string such as UDR and LSR. We have implemented a prototype system and showed on its performance through various experiments.

User Modeling Method for Dynamic-FSM (Dynamic-FSM을 위한 사용자 모델링 방법)

  • Yun Tae-Bok;Park Du-Gyeong;Park Gyo-Hyeon;Lee Ji-Hyeong
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2006.05a
    • /
    • pp.317-321
    • /
    • 2006
  • 게임의 재미요소를 증대 시키고, 게임 생명주기(Life-Cycle)를 늘어나게 하기 위해 다양한 방법이 연구 중이다. 현실감 있는 그래픽 효과와 뛰어난 음향 효과 등과 함께 게임 플레이어의 게임 스타일이 반영된 게임을 만들기 위한 방법이 대표적이 예라 할 수 있다. 그 중 게임 플레이어의 스타일을 게임에 다시 이용하기 위해서는 플레이어의 인지과정이 요구되며, 인지된 결과를 이용하여 플레이어를 모델링(User Modeling)한다. 하지만, 게임의 종류와 특성에 따라 다양한 게임이 존재하기 때문에 플레이어를 모델링하기 어렵다는 문제를 가지고 있다. 본 논문에서는 게임에서 정의된 FSM(Finite State machine)을 이용하여 플레이어가 선택한 행동 패턴을 분석하고 적용하는 방법과 다양한 게임에서 이용 할 수 있는 스크립트 형태의 NPC 행동 패턴 변경 방법을 제안한다. 플레이어의 데이터를 분석하여 얻은 결과는 FSM을 변경하여 새로운 행동을 보이는 NPC(Non-Player Characters)를 생성하는데 사용되며, 이 캐릭터는 게임의 특성과 플레이어의 최신 행동 패턴 경향을 학습한 적용형 NPC라 할 수 있다. 실험을 통하여 사용자의 행동과 유사한 패턴을 보이는 NPC의 생성을 확인할 수 있었으며, 게임에서 상대적인 또는 적대적인 캐릭터로 유용하게 사용 될 수 있다.

  • PDF

Simulator for Active Sonar Target Recognition (능동소나 표적인식을 위한 시뮬레이터)

  • Seok, Jongwon;Kim, Taehwan;Bae, Keunsung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.10
    • /
    • pp.2137-2142
    • /
    • 2012
  • Many studies in detection and classification of the targets in the underwater environments have been conducted for military purposes, as well as for non-military purpose. Due to the complicated characteristics of underwater acoustic signal reflecting multipath environments and spatio-temporal varying characteristics, active sonar target classification technique has been considered as a difficult technique. And it has a difficult in collecting actual underwater data. In this paper, we implemented the simulator to synthesize the active target signal, to extract feature and to classify the target in the underwater environment. In target signal synthesis, highlight and three-dimensional model are used and multi-aspect based hidden markov model is used for target classification.

Active Sonar Target Recognition Using Fractional Fourier Transform (Fractional Fourier 변환을 이용한 능동소나 표적 인식)

  • Seok, Jongwon;Kim, Taehwan;Bae, Geon-Seong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.11
    • /
    • pp.2505-2511
    • /
    • 2013
  • Many studies in detection and classification of the targets in the underwater environments have been conducted for military purposes, as well as for non-military purpose. Due to the complicated characteristics of underwater acoustic signal reflecting multipath environments and spatio-temporal varying characteristics, active sonar target classification technique has been considered as a difficult technique. And it has difficulties in collecting actual underwater data. In this paper, we synthesized active target echoes based on ray tracing algorithm using target model having 3-dimensional highlight distribution. Then, Fractional Fourier transform was applied to synthesized target echoes to extract feature vector. Recognition experiment was performed using neural network classifier.

A Survey on Dynamical Modeling for Active Control of Thermo-Acoustic Instabilities (열-음향학적 불안정 현상의 능동제어를 위한 동역학적 모델링에 관한 현황 분석)

  • Na, Seon-Hwa;Ko, Sang-Ho
    • Journal of the Korean Society of Propulsion Engineers
    • /
    • v.15 no.6
    • /
    • pp.78-90
    • /
    • 2011
  • This paper surveys the recent research activities regarding dynamical modeling of thermo-acoustic instabilities which are fundamental to actively control such phenomena in gas-turbine engines, rockets, and etc. For this, we introduce reduced-order modeling approaches, mainly conducted after 1990s. Particularly, we survey grey-box approaches, which determine the structure of the model based on physical rules and use system's input-output data for estimating parameters of the model. We also introduce black-box approaches using model structures without physics-based interpretation. Finally, we briefly discuss future directions and feasibilities of the research in this field.