• Title/Summary/Keyword: Sound recognition

Search Result 311, Processing Time 0.024 seconds

A Development of Ultrasonic-wave Remote Control System For Recovering a Submarine Survey Equipment (해저 탐사 및 관측 장비 회수를 위한 초음파 원격제어시스템 개발)

  • Kim, Young-Jin;Huh, Kyung-Moo;Jeong, Han-Cheol;Woo, Jong-Sik;Cho, Young-June
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.117-119
    • /
    • 2004
  • In order to successfully exploit underwater resources, the first step would be a marine environmental research and exploration on the seafloor. Traditionally one sets up a long-term underwater experimental unit on the seafloor and retrieves the unit later after a certain period time. Essential to these applications is the reliable teleoperation and telemetering of the unit. This study presents ultrasonic-wave remote control system and an underwater sound recognition algorithm that can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for units which require low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through experimental methods.

  • PDF

A Separator system for underwater observing instrument (수중 관측 및 탐사장비 원격분리 시스템의 개발)

  • Kim, Young-Jin;Jeong, Han-Cheol;Huh, Kyung-Moo;Cho, Young-June
    • Proceedings of the KIEE Conference
    • /
    • 2005.05a
    • /
    • pp.158-160
    • /
    • 2005
  • In order to successfully exploit underwater resources, the first step would be a marine environmental research and exploration on the seafloor. Traditionally one sets up a long-term underwater experimental unit on the seafloor and retrieves the unit later after a certain period time. Essential to these applications is the reliable teleoperation and telemetering of the unit. In our proposed ultrasonic-wave remote control system and an underwater sound recognition algorithm that can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for units which require low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through experimental methods.

  • PDF

Context Recognition Using Environmental Sound for Client Monitoring System (피보호자 모니터링 시스템을 위한 환경음 기반 상황 인식)

  • Ji, Seung-Eun;Jo, Jun-Yeong;Lee, Chung-Keun;Oh, Siwon;Kim, Wooil
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.2
    • /
    • pp.343-350
    • /
    • 2015
  • This paper presents a context recognition method using environmental sound signals, which is applied to a mobile-based client monitoring system. Seven acoustic contexts are defined and the corresponding environmental sound signals are obtained for the experiments. To evaluate the performance of the context recognition, MFCC and LPCC method are employed as feature extraction, and statistical pattern recognition method are used employing GMM and HMM as acoustic models, The experimental results show that LPCC and HMM are more effective at improving context recognition accuracy compared to MFCC and GMM respectively. The recognition system using LPCC and HMM obtains 96.03% in recognition accuracy. These results demonstrate that LPCC is effective to represent environmental sounds which contain more various frequency components compared to human speech. They also prove that HMM is more effective to model the time-varying environmental sounds compared to GMM.

Design and Implementation of Vocal Sound Variation Rules for Korean Language (한국어 음운 변동 처리 규칙의 설계 및 구현)

  • Lee, Gye-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.3
    • /
    • pp.851-861
    • /
    • 1998
  • Korean language is to be characterized by the rich vocal sound variation. In order to increase the probability of vocal sound recognition and to provide a natural vocal sound synthesis, a systematic and thorough research into the characteristics of Korean language including its vocal sound changing rules is required. This paper addresses an effective way of vocal sound recognition and synthesis by providing the design and implementation of the Korean vocal sound variation rule. The regulation we followed for the design of the vocal sound variation rule is the Phonetic Standard(Section 30. Chapter 7) of the Korean Orthographic Standards. We have first factor out rules for each regulations, then grouped them into 27 groups for eaeh final-consonant. The Phonological Change Processing System suggested in the paper provides a fast processing ability for vocal sound variation by a single application of the rule. The contents of the process for information augmented to words or the stem of innected words are included in the rules. We believe that the Phonological Change Processing System will facilitate the vocal sound recognition and synthesis by the sentence. Also, this system may be referred as an example for similar research areas.

  • PDF

A Study on Phoneme Likely Units to Improve the Performance of Context-dependent Acoustic Models in Speech Recognition (음성인식에서 문맥의존 음향모델의 성능향상을 위한 유사음소단위에 관한 연구)

  • 임영춘;오세진;김광동;노덕규;송민규;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.5
    • /
    • pp.388-402
    • /
    • 2003
  • In this paper, we carried out the word, 4 continuous digits. continuous, and task-independent word recognition experiments to verify the effectiveness of the re-defined phoneme-likely units (PLUs) for the phonetic decision tree based HM-Net (Hidden Markov Network) context-dependent (CD) acoustic modeling in Korean appropriately. In case of the 48 PLUs, the phonemes /ㅂ/, /ㄷ/, /ㄱ/ are separated by initial sound, medial vowel, final consonant, and the consonants /ㄹ/, /ㅈ/, /ㅎ/ are also separated by initial sound, final consonant according to the position of syllable, word, and sentence, respectively. In this paper. therefore, we re-define the 39 PLUs by unifying the one phoneme in the separated initial sound, medial vowel, and final consonant of the 48 PLUs to construct the CD acoustic models effectively. Through the experimental results using the re-defined 39 PLUs, in word recognition experiments with the context-independent (CI) acoustic models, the 48 PLUs has an average of 7.06%, higher recognition accuracy than the 39 PLUs used. But in the speaker-independent word recognition experiments with the CD acoustic models, the 39 PLUs has an average of 0.61% better recognition accuracy than the 48 PLUs used. In the 4 continuous digits recognition experiments with the liaison phenomena. the 39 PLUs has also an average of 6.55% higher recognition accuracy. And then, in continuous speech recognition experiments, the 39 PLUs has an average of 15.08% better recognition accuracy than the 48 PLUs used too. Finally, though the 48, 39 PLUs have the lower recognition accuracy, the 39 PLUs has an average of 1.17% higher recognition characteristic than the 48 PLUs used in the task-independent word recognition experiments according to the unknown contextual factor. Through the above experiments, we verified the effectiveness of the re-defined 39 PLUs compared to the 48PLUs to construct the CD acoustic models in this paper.

A Quality Identification System for Molding Parts Using HTM-Based Sound Recognition (HTM 기반의 소리 연식을 이용한 부품의 양.불량 판별 시스템)

  • Bae, Sun-Gap;Han, Chang-Young;Seo, Dae-Ho;Kim, Sung-Jin;Bae, Jong-Min;Kang, Hyun-Syug
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.10
    • /
    • pp.1494-1505
    • /
    • 2010
  • A variety of sounds take place in medium and small-sized manufactories producing many kinds of parts in a small quantity with one press. We developed the identification system for the quality of parts using HTM(Hierarchical Temporal Memory)-based sound recognition. HTM is the theory that the operation principle of human brain's neocortex is applied to computer, suggested by Jeff Hopkins. This theory memorizes temporal and spatial patterns hierarchically about the real world, which is known for its cognitive power superior to the previous recognition technologies in many cases. By applying the HTM model to the sound recognition, we developed the identification system for the quality of molding parts. In order to verify its performance we recorded the various sounds at the moment of producing parts in the real factory, constructed the HTM network of sound, and then identified the quality of parts by repeating learning and training. It reveals that this system gets an excellent and accurate results at the noisy factory.

Sound Model Generation using Most Frequent Model Search for Recognizing Animal Vocalization (최대 빈도모델 탐색을 이용한 동물소리 인식용 소리모델생성)

  • Ko, Youjung;Kim, Yoonjoong
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.1
    • /
    • pp.85-94
    • /
    • 2017
  • In this paper, I proposed a sound model generation and a most frequent model search algorithm for recognizing animal vocalization. The sound model generation algorithm generates a optimal set of models through repeating processes such as the training process, the Viterbi Search process, and the most frequent model search process while adjusting HMM(Hidden Markov Model) structure to improve global recognition rate. The most frequent model search algorithm searches the list of models produced by Viterbi Search Algorithm for the most frequent model and makes it be the final decision of recognition process. It is implemented using MFCC(Mel Frequency Cepstral Coefficient) for the sound feature, HMM for the model, and C# programming language. To evaluate the algorithm, a set of animal sounds for 27 species were prepared and the experiment showed that the sound model generation algorithm generates 27 HMM models with 97.29 percent of recognition rate.

Development of the Mechanical Timer's Gear Sound Recognition system (기계식 타이머의 치차음 인식 시스템 개발)

  • 서영호;이돈진;안중환
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2001.04a
    • /
    • pp.217-220
    • /
    • 2001
  • We have developed the gear sound recognition system of mechanical timer. A mechancal timer is superior in endurance to electronic timer. So it is reliable under severe operating environment. It is putting together several kind of gears. Therefore when the timer operates, it emits mechanical sound of gears. We have chosen a microphone to detect the gear sound. A microphone is more efficient and convenient than other sensors. Because it is of low price and non-contact type sensor. For ease of measurement we designed real-time processing software based on graphical user interface.

  • PDF

An Experimental Study on the Optimistic Recognition Level of Public Address System as a Soundscape Application Facility (사운드스케이프 적용을 위한 옥외 P.A. 시스템의 적정 인지레벨에 관한 실험적 연구)

  • Song, Min-Jeong;Jang, Gil-Soo
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.17 no.11
    • /
    • pp.1050-1055
    • /
    • 2007
  • P.A.(public address) system is considered as an useful active soundscape appliance which can gives a place identity and vitality by introducing conventional musics, environmental musics, bird singing sounds etc. In this study, the main aim is to know the optimistic distance from the speaker and sound pressure level range of introducing sound. So, the sound pressure level of P.A. system due to distances were measured and subjects' responses with level variations were checked. The main results are as follows. Level range from 64 dB to 71 dB is comfortable for subjects. And the optimal level of introducing sound is related with sound source characteristics. The results of this study could be used for street furniture location design and P.A. system output level.

An Experimental Study on the Optimistic Recognition Level of Public Address System as a Soundscape Application Facility (사운드스케이프 적용을 위한 옥외 P.A. 시스템 적정 인지레벨에 관한 실험적 연구)

  • Song, Min-Jeong;Jang, Gil-Soo;Shin, Hoon;Shin, Young-Gyu;Lee, Tai-Kang
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2006.05a
    • /
    • pp.726-729
    • /
    • 2006
  • As a active soundscape facility, P.A. system is a useful instrument to give place identity and vitality by letting out music, environmental music, bird singing sound etc. In this study, to know the optimistic distance and sound level range of introducing sound, sound levels due to distance were measured and subject responses were checked by questionnaire. Levels from 64dB to 71dB are recommended by subjects. And the optimistic level of introducing level is related with level variance of sound source. The results of this study could used for street furniture location design and P.A. system output level.

  • PDF