• Title/Summary/Keyword: Audio signal analysis

Search Result 74, Processing Time 0.026 seconds

DECODE: A Novel Method of DEep CNN-based Object DEtection using Chirps Emission and Echo Signals in Indoor Environment (실내 환경에서 Chirp Emission과 Echo Signal을 이용한 심층신경망 기반 객체 감지 기법)

  • Nam, Hyunsoo;Jeong, Jongpil
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.3
    • /
    • pp.59-66
    • /
    • 2021
  • Humans mainly recognize surrounding objects using visual and auditory information among the five senses (sight, hearing, smell, touch, taste). Major research related to the latest object recognition mainly focuses on analysis using image sensor information. In this paper, after emitting various chirp audio signals into the observation space, collecting echoes through a 2-channel receiving sensor, converting them into spectral images, an object recognition experiment in 3D space was conducted using an image learning algorithm based on deep learning. Through this experiment, the experiment was conducted in a situation where there is noise and echo generated in a general indoor environment, not in the ideal condition of an anechoic room, and the object recognition through echo was able to estimate the position of the object with 83% accuracy. In addition, it was possible to obtain visual information through sound through learning of 3D sound by mapping the inference result to the observation space and the 3D sound spatial signal and outputting it as sound. This means that the use of various echo information along with image information is required for object recognition research, and it is thought that this technology can be used for augmented reality through 3D sound.

Automatic measurement of voluntary reaction time after audio-visual stimulation and generation of synchronization and generation of synchronization signals for the analysis of evoked EEG (시청각자극후의 피험자의 자의적 반응시간의 자동계측과 유발뇌파분석을 위한 동기신호의 생성)

  • 김철승;엄광문;손진훈
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 2003.05a
    • /
    • pp.36-40
    • /
    • 2003
  • 근래에 들어 질병으로 인하여 의사표현이 곤란한 환자에게 뇌파에 기초한 BCI(Brain Computer Interface)와 같은 새로운 인터페이스를 제공하고자 하는 연구가 활발히 진행되고 있다. BCI를 위한 기초 연구로서 특정 자극에 대해 유발되는 뇌파의 측정과 분석은 BCI를 위한 뇌파의 패턴과 인터페이스의 설계에 중요한 역할을 한다. 이 연구의 목적은 시청각 자극 인가후 피험자의 반응 시간을 측정하는 시스템을 EEG와 같은 생체 신호 계측 시스템과 연동이 가능한 형태로 개발하는 것이다. 제안된 시스템은 기능적으로 자극 신호 발생부, 반응시간 측정부, 유발뇌파 측정부, 동기신호발생부로 나뉘어진다. 자극신호 발생부는 실험에 이용되는 자극신호를 제작하는 부분으로서 Flash를 사용하여 구현하였다. 반응시간 측정부는 문제에 대한 답 선택 요청시각으로부터 피험자의 반응까지의 시간을 측정하는 부분으로서 마이크로 컴퓨터(80C31)를 이용하여 구현하였다. 우발뇌파 측정부는 시판용 하드웨어와 소프트웨어를 그대로 사용하였다. 동기신호 발생부는 전체 시스템의 동기를 맞추기 위한 신호를 발생하는 부분으로서 문제제시, 답요구와 동기한 화면상의 명암 신호와 이를 검출하는 광센서로 구성하였다. 본 논문에서 제시한 방법에서는 기존의 유발진위 측정 및 자극시스템에 특정 모듈(반응시간 측정 장치, 동기신호 발생장치)만을 추가하여 실험자의 의도에 맞는 시스템을 설계할 수 있어 유발 뇌파 및 반응시간 측정을 필요로 하는 연구를 가속화 할 것이 기대된다.

  • PDF

Classification of General Sound with Non-negativity Constraints (비음수 제약을 통한 일반 소리 분류)

  • 조용춘;최승진;방승양
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.10
    • /
    • pp.1412-1417
    • /
    • 2004
  • Sparse coding or independent component analysis (ICA) which is a holistic representation, was successfully applied to elucidate early auditor${\gamma}$ processing and to the task of sound classification. In contrast, parts-based representation is an alternative way o) understanding object recognition in brain. In this thesis we employ the non-negative matrix factorization (NMF) which learns parts-based representation in the task of sound classification. Methods of feature extraction from the spectro-temporal sounds using the NMF in the absence or presence of noise, are explained. Experimental results show that NMF-based features improve the performance of sound classification over ICA-based features.

A study on ultrasound analysis of the transformer strange signal (변압기 이상음의 초음파 분석에 관한 연구)

  • 백화종;지석근
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.11a
    • /
    • pp.835-838
    • /
    • 2002
  • A running high voltage equipments produce ultrasonic wave that has unique sound by the specific characteristics of the electricity. The generation of the ultrasonic wave is made by the electric transform like arcing, corona, and tracking so on. The mechanical losses and fatal human damages are happened by the electric failure of high voltage equipments. To prevent and diagnose the obstacle factors of the high voltage equipments, the measurement of the ultrasonic wave became to be prominent. However standardized data have been a deficient situation by now. This paper measures the ultrasonic wave coming from the real running transformer equipments and transforms it as an audio frequency. Measured data represents as frequency and time domain through the FFT(Fast Fourier Transform) transform. In conclusion, the purpose of this paper is to standardize the analyzed data.

  • PDF

Sound System Design and Characteristic Analysis based on Power Line Communication (전력선통신 기반 음향 시스템 설계 및 특성 분석)

  • Kim, Kwan-Kyu;Yeom, Keong-Tae;Kim, Kwan-Woong;Kim, Yong-Kab
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.6
    • /
    • pp.1-7
    • /
    • 2008
  • The paper is to solve the problem of existing sound system, which has difficulties of system organization and the increase of additional install cost and unfriendly interior. To solve the existing system, we drew the new sound system based on PLC and studied it. A transmitter and a receiver were designed using the PLC chip INT5500CS. Sound system was configured with a CD player that sound signals are sent from the transmitter and a speaker connected to the receiver. For analysis of characteristics of this system, a USBPre external sound card and Smaart Live 5 which is a PC-based sound measuring program were added. As a result of our experiment, the measured signal level is $2{\sim}3$[dB] lower than reference signal, latency is 16.69[ms] and the specific character of coherency is bad in high frequency band. Otherwise, this system transmits and receives signals over 90[%] in good condition as a result of measuring pink noise, frequency(1kHz), and phase, magnitude. In view of the result so far achieved, the system designed our team has excellent performance, it resolves defect of existing audio signal transmition system.

Parametric Crack and Flexural Strength Analyses of Concrete Slab For Railway Structures Using GFRP Rebar (GFRP 보강근을 적용한 교량용 콘크리트 도상슬래브의 균열 및 휨강도 변수 해석)

  • Choe, Hyeong-Bae;Lee, Sang-Youl
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.34 no.6
    • /
    • pp.363-370
    • /
    • 2021
  • In this paper, we presented an optimized crack and flexural strength analysis of a glass-fiber reinforced polymer (GFRP) rebar, used as reinforcements for in-site railway concrete slabs. The insulation performance of a GFRP rebar has the advantage of avoiding the loss of signal current in an audio frequency (AF) track circuit. A full-scale experiment, and three-dimensional finite element simulation results were compared to validate our approaches. Parametric numerical results revealed that the diameters and arrangements of the GFRP rebar had a significant effect on the flexural strength and crack control performances of the concrete track slabs. The results of this study could serve as a benchmark for future guidelines in designing more efficient, and economical concrete slabs using the GFRP rebar.

Utility Estimation of the Application of Auditory-Visual-Tactile Sense Feedback in Respiratory Gated Radiation Therapy (호흡동조방사선치료 시 Real Time Monitor와 Ventilator의 유용성 평가)

  • Jo, Jung Hun;Kim, Byeong Jin;Roh, Shi Won;Lee, Hyeon Chan;Jang, Hyeong Jun;Kim, Hoi Nam;Song, Jae Hun;Kim, Young Jae
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.25 no.1
    • /
    • pp.33-40
    • /
    • 2013
  • Purpose: The purpose of this study was to evaluate the possibility to optimize the gated treatment delivery time and maintenance of stable respiratory by the introduction of breath with the assistance of auditory-visual-tactile sense. Materials and Methods: The experimenter's respiration were measured by ANZAI 4D system. We obtained natural breathing signal, monitor-induced breathing signal, monitor & ventilator-induced breathing signal, and breath-hold signal using real time monitor during 10 minutes beam-on-time. In order to check the stability of respiratory signals distributed in each group were compared with means, standard deviation, variation value, beam_time of the respiratory signal. Results: The stability of each respiratory was measured in consideration of deviation change studied in each respiratory time lapse. As a result of an analysis of respiratory signal, all experimenters has showed that breathing signal used both Real time monitor and Ventilator was the most stable and shortest time. Conclusion: In this study, it was evaluated that respiratory gated radiation therapy with auditory-visual-tactual sense and without auditory-visual-tactual sense feedback. The study showed that respiratory gated radiation therapy delivery time could significantly be improved by the application of video feedback when this is combined with audio-tactual sense assistance. This delivery technique did prove its feasibility to limit the tumor motion during treatment delivery for all patients to a defined value while maintaining the accuracy and proved the applicability of the technique in a conventional clinical schedule.

  • PDF

Performance Improvement analysis of Acoustic Communication System using Receive Diversity (수신 다이버시티를 이용한 음향 통신 시스템의 성능 향상 분석)

  • Bok, Jun-Yeong;Ryu, Heung-Gyoon
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.3A
    • /
    • pp.198-204
    • /
    • 2011
  • Acoustic communication system is a transmission technology sending sound and data simultaneously. However, data signal can be audible in this system when data is transmitted with high transmission power. The more transmission power is reduced, the more distance that can transmit data is shortened. Therefore, the study that increase the transmission distance is needed. In this paper, we would like to increase transmission distance by adapting receive diversity in acoustic communication system. We measure received performance of both proposed system and Single Input Sing Output (SISO) system according to distance with same transmission power. When SISO satisfies Bit Error Rate (BER) of $7{\times}10^{-3}$ at about 2m, Selection Combining (SC) technique satisfies 2 meters, and Equal Gain Combining (EGC) technique satisfies 4 meters.

A Study on Analysis of Clinical Data and Telemedicine System for the Treatment of Acrophobia (고소공포증 치료를 위한 원격진료 시스템 및 데이터 분석에 대한 연구)

  • Ryu, Jong-Hyun;Paek, Seung-Eun
    • The Journal of Information Technology
    • /
    • v.9 no.1
    • /
    • pp.21-32
    • /
    • 2006
  • Acrophobia is a symptom of feeling an abnormal fear of heights. Medications or cognitive-behavior methods have been mainly used to treat the acrophobia. In these days the virtua1 reality technology has been applied to treat such an anxiety disorders. In this thesis, an telemedicine assistant system for treatment of acrophobia using biomedical signals and virtual reality technique is proposed. I made two virtual reality simulations for treatment of acrophobia and telemedicine system for communication between doctor and patient using personal computer. A virtual environment provides patient with stimuli which arouses phobia, and exposition to such environment makes him have ability to overcome the fear. Recently, the patient can take diagnosis from a medical doctor in distance with the telemedicine system. Multimedia conference service, on-line questionary, signal transfer system are needed to configure such system. Virtual reality simulation system that composed of position sensor, head mount display, and audio system, is also included in this telemedicine system. I added virtual environment update system to this virtual reality telemedicine system for treatment of acrophobia. Former acrophobia treatment systems use only patient's score of the questionary to appraise. The new system developed in this thesis uses not only patient's score of the questionary but also biomedical signals such as HR, GSR amplitude, GSR RT to increase the objectivity and quantitativity. The experimental results show that HR and GSR amplitude are useful for decision of acrophobia. We will apply this system to the acrophobia patient in distance and be able to offer better medical treatment for mental illness in near future.

  • PDF

Comprehensive analysis of deep learning-based target classifiers in small and imbalanced active sonar datasets (소량 및 불균형 능동소나 데이터세트에 대한 딥러닝 기반 표적식별기의 종합적인 분석)

  • Geunhwan Kim;Youngsang Hwang;Sungjin Shin;Juho Kim;Soobok Hwang;Youngmin Choo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.4
    • /
    • pp.329-344
    • /
    • 2023
  • In this study, we comprehensively analyze the generalization performance of various deep learning-based active sonar target classifiers when applied to small and imbalanced active sonar datasets. To generate the active sonar datasets, we use data from two different oceanic experiments conducted at different times and ocean. Each sample in the active sonar datasets is a time-frequency domain image, which is extracted from audio signal of contact after the detection process. For the comprehensive analysis, we utilize 22 Convolutional Neural Networks (CNN) models. Two datasets are used as train/validation datasets and test datasets, alternatively. To calculate the variance in the output of the target classifiers, the train/validation/test datasets are repeated 10 times. Hyperparameters for training are optimized using Bayesian optimization. The results demonstrate that shallow CNN models show superior robustness and generalization performance compared to most of deep CNN models. The results from this paper can serve as a valuable reference for future research directions in deep learning-based active sonar target classification.