• Title/Summary/Keyword: Sound recognition

Search Result 311, Processing Time 0.026 seconds

DECODE: A Novel Method of DEep CNN-based Object DEtection using Chirps Emission and Echo Signals in Indoor Environment (실내 환경에서 Chirp Emission과 Echo Signal을 이용한 심층신경망 기반 객체 감지 기법)

  • Nam, Hyunsoo;Jeong, Jongpil
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.3
    • /
    • pp.59-66
    • /
    • 2021
  • Humans mainly recognize surrounding objects using visual and auditory information among the five senses (sight, hearing, smell, touch, taste). Major research related to the latest object recognition mainly focuses on analysis using image sensor information. In this paper, after emitting various chirp audio signals into the observation space, collecting echoes through a 2-channel receiving sensor, converting them into spectral images, an object recognition experiment in 3D space was conducted using an image learning algorithm based on deep learning. Through this experiment, the experiment was conducted in a situation where there is noise and echo generated in a general indoor environment, not in the ideal condition of an anechoic room, and the object recognition through echo was able to estimate the position of the object with 83% accuracy. In addition, it was possible to obtain visual information through sound through learning of 3D sound by mapping the inference result to the observation space and the 3D sound spatial signal and outputting it as sound. This means that the use of various echo information along with image information is required for object recognition research, and it is thought that this technology can be used for augmented reality through 3D sound.

Optimize the Acoustic Environment Using a Sound Masking Effects of the Audio Signal Compression Principle (음성신호의 압축원리를 이용한 사운드 마스킹 효과로 음향 환경 최적화)

  • Ann, Sook-Hyang
    • Journal of the Korean Institute of Electrical and Electronic Material Engineers
    • /
    • v.28 no.11
    • /
    • pp.748-751
    • /
    • 2015
  • Sound Masking System technology as by sound the same on all bands and artificially generates a constant sound shield People want to hear or recognize the people with the noise generated from the interior of the way. Prevent hearing or prevent recognition by using the technology to control the audible frequency band Continue to emit constant and uniform shielding sound audible frequency band Even the security content of speech (20 Hz~20 KHz). That interception laser eavesdropping, internal solicitations, during recording Or delay the decoding was a result of the effect of interference calculated Experience noise disturbance index is applied around the Stress Index is the average index is 10.16 was a luxury for the average index is then applied to the index 3.07 Noise is significantly lower stress level has improved noise conditions.

Recognition of the Direct and Reflected Sounds in an Irregulary Formed Chamber (비정방형실내에서의 직접음과 반사음 식별에 관한 연구)

  • 차일환;박규태;임광호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.2 no.1
    • /
    • pp.11-19
    • /
    • 1983
  • An irregulary formed chamber was designed and constructed to recognize the direct sound radiated from the sound source and the reflected sound from the walls of the chamber. The sound signal used was tone burst in the frequency response characteristics with the signal detection after transient effect. The direct wave, transient phenomena and the primary reflected sound could be asiily distinguished each other by measurements of the arrival time of the time difference. And also noise could be easily distinguished by the same method. The result obtained can be used in industries for automatic measurement of the sound pressure reponse characteristics with respect to frequencies.

  • PDF

Heart Sound Recognition by Analysis of wavelet transform and Neural network.

  • Lee, Jung-Jun;Lee, Sang-Min;Hong, Seung-Hong
    • Proceedings of the IEEK Conference
    • /
    • 2000.07b
    • /
    • pp.1045-1048
    • /
    • 2000
  • This paper presents the application of the wavelet transform analysis and the neural network method to the phonocardiogram (PCG) signal. Heart sound is a acoustic signal generated by cardiac valves, myocardium and blood flow and is a very complex and nonstationary signal composed of many source. Heart sound can be discriminated normal heart sound and heart murmur. Murmurs have broader frequency bandwidth than the normal ones and can occur at random position of cardiac cycle. In this paper, we classified the group of heart sound as normal heart sound(NO), pre-systolic murmur(PS), early systolic murmur(ES), late systolic murmur(LS), early diastolic murmur(ED). And we used the wavelet transform to shorten artifacts and strengthen the low level signal. The ANN system was trained and tested with the back- propagation algorithm from a large data set of examples-normal and abnormal signals classified by expert. The best ANN configuration occurred with 15 hidden layer neurons. We can get the accuracy of 85.6% by using the proposed algorithm.

  • PDF

A Study on the Expression Recognition of the Experience of the Sinmyung and the Movement in the Korean Dance of College Students Majoring in Musical: A Qualitative (뮤지컬 전공대학생들의 한국 춤 신명체험(神明體驗)과 움직임 표현인식;질적 접근)

  • Jeong, Tae-seon;Ahn, Byoung-Soon
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.12
    • /
    • pp.383-393
    • /
    • 2018
  • The purpose of this paper is to study on the elements of the Sinmyung and the expression recognition of body movement in Korean dance of college students majoring in musical. The participants were 12 male and female college students in musical major who experienced in dance, song and acting. The program was composed of the experience of the Sinmyung: recognition of sound and dance, breathing and movement in the Korean dance, 8 hours twice a week for four weeks. As a qualitative approach is the discovery of the center of the process, we carried out an inductive analysis of the area on the basis of observation, in-depth interview and student report. The core of this analysis is to attempt to analyze contents concentrating on the recognition exploration of the Sinmyung sentiment and the body expression through sound and breathing. In conclusion, for college students majoring in musical, the expression recognition of the experience of the Sinmyung and the movement in the Korean dance contributes to the improvement of creative thinking through body perception, and the practical use of the capacity of image expression through concentration of sound and breathing. Finally, the results of this research could articulate with the value of body expression and the creative factors of college students majoring in musical.

A HMM-based Method of Reducing the Time for Processing Sound Commands in Computer Games (컴퓨터 게임에서 HMM 기반의 명령어 신호 처리 시간 단축을 위한 방법)

  • Park, Dosaeng;Kim, Sangchul
    • Journal of Korea Game Society
    • /
    • v.16 no.2
    • /
    • pp.119-128
    • /
    • 2016
  • In computer games, most of GUI methods are keyboards, mouses and touch screens. The total time of processing the sound commands for games is the sum of input time and recognition time. In this paper, we propose a method for taking only the prefixes of the input signals for sound commands, resulting in the reduced the total processing time, instead of taking the whole input signals. In our method, command sounds are recognized using HMM(Hidden Markov Model), where separate HMM's are built for the whole input signals and their prefix signals. We experiment our proposed method with representative commands of platform games. The experiment shows that the total processing time of input command signals reduces without decreasing recognition rate significantly. The study will contribute to enhance the versatility of GUI for computer games.

Phoneme segmentation and Recognition using Support Vector Machines (Support Vector Machines에 의한 음소 분할 및 인식)

  • Lee, Gwang-Seok;Kim, Deok-Hyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.05a
    • /
    • pp.981-984
    • /
    • 2010
  • In this paper, we used Support Vector Machines(SVMs) as the learning method, one of Artificial Neural Network, to segregated from the continuous speech into phonemes, an initial, medial, and final sound, and then, performed continuous speech recognition from it. A Decision boundary of phoneme is determined by algorithm with maximum frequency in a short interval. Speech recognition process is performed by Continuous Hidden Markov Model(CHMM), and we compared it with another phoneme segregated from the eye-measurement. From the simulation results, we confirmed that the method, SVMs, we proposed is more effective in an initial sound than Gaussian Mixture Models(GMMs).

  • PDF

Directed Identification, Synchronization by Aesthetic Recognition of Animation Field (애니메이션 분야의 심미적 인식에 의한 동일시와 동기화 연출)

  • Lee, Hyun Woo;Ryu, Chang Su
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.10
    • /
    • pp.1475-1482
    • /
    • 2022
  • Mickey Mousing perfect match between animation sound and image was an aesthetic in the field of animation, but since the 2000s, works such as and released by producers such as DreamWorks and Pixar have expanded the perfection of synchronization to irony. It also influenced the identification system of sentiment. It is time to view the directing attempt of these elements as a factor that changed the new paradigm of narrative, and related research is needed. In this study, the scene of was analyzed as a case study for the synchronization of animation sound and image components and the boundary direction on the recognition of identification between reality and fiction. Aesthetic recognition of the research work is based on the premise of real time and space perception, and the audience can recognize in the conceptual world as an integrated art by playfully producing fictional time and space. The direct antithesis of synchronization and identification was drawn to maintain the curiosity of the next scene by repeating selective concealment and disclosure of information in the direction of conveying an unfamiliar and heterogeneous feeling to the audience.

Experiment on the Perception of Fire Alarm Sound of Small Construction Site Workers (소규모 건설공사현장 작업자의 화재경보음 인지 실험)

  • Pil-Jae Moon;Seo-Young Kim;Ha-Sung Kong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.1
    • /
    • pp.153-160
    • /
    • 2023
  • This research experiments on the workers' recognition of the fire alarm sound for sirens and portable loudspeakers in a small construction site. As a result of analyzing the siren alarm sound recognition from measuring on the 1st, 2nd, and 4th floors, the sound was more unrecognizable on the 4th floor than on the 1st, and 1 person on the 1st floor was unable to recognize all sounds. In the case of the 2nd floor, one person could not notice the alarm in the last 3rd trial, and another did not realize it all three times. For the 4th floor, 3 people demonstrated unrecognition in all 3 tests. As a result of analyzing the recognition of portable loudspeaker alarm sounds, 1 person could not recognize all sounds on the first floor. In the case of the 2nd floor, 2 people were confirmed to be unable to notice, and lastly, 4 people could not recognize all trials on the 4th floor. The subjects who didn't recognize the sound were unable to distinguish between portable loudspeaker alarm sound and work noise due to the workspace and obstacles.