• Title/Summary/Keyword: Sound recognition

Search Result 311, Processing Time 0.025 seconds

Computerization and Application of the Korean Standard Pronunciation Rules (한국어 표준발음법의 전산화 및 응용)

  • 이계영;임재걸
    • Language and Information
    • /
    • v.7 no.2
    • /
    • pp.81-101
    • /
    • 2003
  • This paper introduces a computerized version of the Korean Standard Pronunciation Rules that can be used in speech engineering systems such as Korean speech synthesis and recognition systems. For this purpose, we build Petri net models for each item of the Standard Pronunciation Rules, and then integrate them into the sound conversion table. The reversion of the Korean Standard Pronunciation Rules regulates the way of matching sounds into grammatically correct written characters. This paper presents not only the sound conversion table but also the character conversion table obtained by reversely converting the sound conversion table. Malting use of these tables, we have implemented a Korean character into a sound system and a Korean sound into the character conversion system, and tested them with various data sets reflecting all the items of the Standard Pronunciation Rules to verify the soundness and completeness of our tables. The test results show that the tables improve the process speed in addition to the soundness and completeness.

  • PDF

The Method of Elevation Accuracy In Sound Source Localization System (음원 위치 추정 시스템의 정확도 향상 방법)

  • Kim, Yong-Eun;Chung, Jin-Gyun
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.2
    • /
    • pp.24-29
    • /
    • 2009
  • Sound source localization system is used in a robot, a video conference and CCTV(Closed-circuit television) systems. In this Sound source localization systems are applied to human and they can receive a number of sound data frames during speaking. In this paper, we propose methods which is reducing angle estimation error by selecting sound data frame which can more precisely compute the angles from inputted sound data frame. After selected data converted to angle, the error of sound source localization recognition system can be reduced by applying to medium filter. By the experiment using proposed system it is shown that the average error of angle estimation in sound source recognition system can be reduced up to 31 %.

Machine Fault Diagnosis Method based on DWT Power Spectral Density using Multi Patten Recognition (다중 패턴 인식 기법을 이용한 DWT 전력 스펙트럼 밀도 기반 기계 고장 진단 기법)

  • Kang, Kyung-Won;Lee, Kyeong-Min;Vununu, Caleb;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.11
    • /
    • pp.1233-1241
    • /
    • 2019
  • The goal of the sound-based mechanical fault diagnosis technique is to automatically find abnormal signals in the machine using acoustic emission. Conventional methods of using mathematical models have been found to be inaccurate due to the complexity of industrial mechanical systems and the existence of nonlinear factors such as noise. Therefore, any fault diagnosis issue can be treated as a pattern recognition problem. We propose an automatic fault diagnosis method using discrete wavelet transform and power spectrum density using multi pattern recognition. First, we perform DWT-based filtering analysis for noise cancelling and effective feature extraction. Next, the power spectral density(PSD) is performed on each subband of the DWT in order to effectively extract feature vectors of sound. Finally, each PSD data is extracted with the features of the classifier using multi pattern recognition. The results show that the proposed method can not only be used effectively to detect faults as well as apply to various automatic diagnosis system based on sound.

Sound's Direction Detection and Speech Recognition System for Humanoid Active Audition

  • Kim, Hyun-Don;Choi, Jong-Suk;Lee, Chang-Hoon;Park, Gwi-Tea;Kim, Mun-Sang
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.633-638
    • /
    • 2003
  • In this paper, we propose a humanoid active audition system which detects the direction of sound and performs speech recognition using just three microphones. Compared with previous researches, this system which has simpler algorithm, fewer microphones and better amplifier shows better performance. In order to verify our system's performance, we install the proposed active audition system to the home service robot, called Hombot II, which has been developed at the KIST (Korea Institute of Science and Technology), thus we confirm excellent performance by experimental results

  • PDF

Lip Region Extraction by Gaussian Classifier (가우스 분류기를 이용한 입술영역 추출)

  • Kim, Jeong Yeop
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.2
    • /
    • pp.108-114
    • /
    • 2017
  • Lip reading is a field of image processing to assist the process of sound recognition. In some environment, the capture of sound signal usually has significant noise and therefore, the recognition rate of sound signal decreases. Lip reading can be a good feature for the increase of recognition rates. Conventional lip extraction methods have been proposed widely. Maia et. al. proposed a method by the sum of Cr and Cb. However, there are two problems as follows: the point with maximum saturation is not always regarded as lips region and the inner part of lips such as oral cavity and teeth can be classified as lips. To solve these problems, this paper proposes a method which adopts the histogram-based classifier for the extraction of lips region. The proposed method consists of two stages, learning and test. The amount of computation is minimized because this method has no color conversion. The performance of proposed method gives 66.8% of detection rate compared to 28% of conventional ones.

VOICE CONTROL SYSTEM FOR TELEVISION SET USING MASKING MODEL AS A FRONT-END OF SPEECH RECOGNIZER

  • Usagawa, Tsuyoshi;Iwata, Makoto;Ebata, Masanao
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.991-996
    • /
    • 1994
  • Surrounding noise often affects the performance of speech recognition system when it is used in office or home. Especially situation is more serious when colored and nonstational noise such as an sound from television or other audio equipment is introduced. The authors proposed a voice control system for television set using an adaptive noise canceler, and it works well even is sound of television set has comparable level of speech. In this paper, a new front-end of speech recognition is introduced for the voice control system. This font-end utilizes a simplified masking model to reduce the effect of residual noise. According to experimental results, 90% correct recognition is achieved even if the level of television sound is almost 15dB higher than one of speech.

  • PDF

The Development of Concepts on Sound Propagation of Children (소리의 전달에 대한 아동의 개념 발달 연구)

  • Shin, Eunsoo;Kim, Eunjung
    • Korean Journal of Child Studies
    • /
    • v.34 no.4
    • /
    • pp.19-36
    • /
    • 2013
  • This study examined concept development in children in terms of sound propagation. The subjects consisted of 240 five-, seven-, and ten-year-old children from three kindergartens and two primary schools respectively. The instrument used for the purposes of this study was the examination instrument on sound as developed by the SPACE Project in the UK. The concepts related to sound were classified into five stages including : no recognition, egocentric concept, initial mental models, synthetic mental models, and scientific mental models. The results revealed the existence of significant differences in terms of the types of concept that children were aware of, according to age and context. Most five-, seven-, and ten-year-old children revealed egocentric concepts related to every sounds, drum, rubber band context, however, most five-, seven-, and ten-year-old children revealed their recognition of the synthetic models of sound propagation when using the string telephone context. These results have implications for the contents of science education for children when it comes to the development of concepts related to sound propagation.

Voice Driven Sound Sketch for Animation Authoring Tools (애니메이션 저작도구를 위한 음성 기반 음향 스케치)

  • Kwon, Soon-Il
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.4
    • /
    • pp.1-9
    • /
    • 2010
  • Authoring tools for sketching the motion of characters to be animated have been studied. However the natural interface for sound editing has not been sufficiently studied. In this paper, I present a novel method that sound sample is selected by speaking sound-imitation words(onomatopoeia). Experiment with the method based on statistical models, which is generally used for pattern recognition, showed up to 97% in the accuracy of recognition. In addition, to address the difficulty of data collection for newly enrolled sound samples, the GLR Test based on only one sample of each sound-imitation word showed almost the same accuracy as the previous method.

Application of Speech Recognition with Closed Caption for Content-Based Video Segmentations

  • Son, Jong-Mok;Bae, Keun-Sung
    • Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.135-142
    • /
    • 2005
  • An important aspect of video indexing is the ability to segment video into meaningful segments, i.e., content-based video segmentation. Since the audio signal in the sound track is synchronized with image sequences in the video program, a speech signal in the sound track can be used to segment video into meaningful segments. In this paper, we propose a new approach to content-based video segmentation. This approach uses closed caption to construct a recognition network for speech recognition. Accurate time information for video segmentation is then obtained from the speech recognition process. For the video segmentation experiment for TV news programs, we made 56 video summaries successfully from 57 TV news stories. It demonstrates that the proposed scheme is very promising for content-based video segmentation.

  • PDF

Sound Based Machine Fault Diagnosis System Using Pattern Recognition Techniques

  • Vununu, Caleb;Moon, Kwang-Seok;Lee, Suk-Hwan;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.2
    • /
    • pp.134-143
    • /
    • 2017
  • Machine fault diagnosis recovers all the studies that aim to detect automatically faults or damages on machines. Generally, it is very difficult to diagnose a machine fault by conventional methods based on mathematical models because of the complexity of the real world systems and the obvious existence of nonlinear factors. This study develops an automatic machine fault diagnosis system that uses pattern recognition techniques such as principal component analysis (PCA) and artificial neural networks (ANN). The sounds emitted by the operating machine, a drill in this case, are obtained and analyzed for the different operating conditions. The specific machine conditions considered in this research are the undamaged drill and the defected drill with wear. Principal component analysis is first used to reduce the dimensionality of the original sound data. The first principal components are then used as the inputs of a neural network based classifier to separate normal and defected drill sound data. The results show that the proposed PCA-ANN method can be used for the sounds based automated diagnosis system.