• Title/Summary/Keyword: Sound recognition

Search Result 311, Processing Time 0.143 seconds

A Merging Algorithm with the Discrete Wavelet Transform to Extract Valid Speech-Sounds (이산 웨이브렛 변환을 이용한 유효 음성 추출을 위한 머징 알고리즘)

  • Kim, Jin-Ok;Hwang, Dae-Jun;Paek, Han-Wook;Chung, Chin-Hyun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.8 no.3
    • /
    • pp.289-294
    • /
    • 2002
  • A valid speech-sound block can be classified to provide important information for speech recognition. The classification of the speech-sound block comes from the MRA(multi-resolution analysis) property of the DWT(discrete wavelet transform), which is used to reduce the computational time for the pre-processing of speech recognition. The merging algorithm is proposed to extract valid speech-sounds in terms of position and frequency range. It needs some numerical methods for an adaptive DWT implementation and performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speech-sounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising SNR(signal-to-nolle ratio).

Emotion Recognition Implementation with Multimodalities of Face, Voice and EEG

  • Udurume, Miracle;Caliwag, Angela;Lim, Wansu;Kim, Gwigon
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.3
    • /
    • pp.174-180
    • /
    • 2022
  • Emotion recognition is an essential component of complete interaction between human and machine. The issues related to emotion recognition are a result of the different types of emotions expressed in several forms such as visual, sound, and physiological signal. Recent advancements in the field show that combined modalities, such as visual, voice and electroencephalography signals, lead to better result compared to the use of single modalities separately. Previous studies have explored the use of multiple modalities for accurate predictions of emotion; however the number of studies regarding real-time implementation is limited because of the difficulty in simultaneously implementing multiple modalities of emotion recognition. In this study, we proposed an emotion recognition system for real-time emotion recognition implementation. Our model was built with a multithreading block that enables the implementation of each modality using separate threads for continuous synchronization. First, we separately achieved emotion recognition for each modality before enabling the use of the multithreaded system. To verify the correctness of the results, we compared the performance accuracy of unimodal and multimodal emotion recognitions in real-time. The experimental results showed real-time user emotion recognition of the proposed model. In addition, the effectiveness of the multimodalities for emotion recognition was observed. Our multimodal model was able to obtain an accuracy of 80.1% as compared to the unimodality, which obtained accuracies of 70.9, 54.3, and 63.1%.

지역교차로 교통사고 자동검지시스템 개선을 위한 교차로 제 음향특성의 해석

  • Cho, Eul-Soo;Go, Young-Gwon;Kim, Jae-Yee
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.05a
    • /
    • pp.789-792
    • /
    • 2008
  • Actually, The present traffic accident detection system is subsisting limitation of accurate distinction under the crowded condition at intersection because the system depend upon mainly the image information at intersection and digital image processing techniques nearly all. To complement this insufficiency, this article aims to estimate the level of present technology and a realistic possibility by analyzing the acoustic characteristic of crash sound that we have to investigate for improvement of traffic accident detection rate at intersection. The skid sound of traffic accident is showed the special pattern at $1[kHz]{\sim}3[kHz}$ bandwidth when vehicles are almost never operated in and around intersection. Also, the frequency bandwidth of vehicle crash sound is showed sound pressure difference over 30[dB] higher than when there is no occurrence of traffic accident below 500[Hz].

  • PDF

Sound Monitoring System of Machining using the Statistical Features of Frequency Domain and Artificial Neural Network (주파수 영역의 통계적 특징과 인공신경망을 이용한 기계가공의 사운드 모니터링 시스템)

  • Lee, Kyeong-Min;Vununu, Caleb;Lee, Suk-Hwan;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.8
    • /
    • pp.837-848
    • /
    • 2018
  • Monitoring technology of machining has a long history since unmanned machining was introduced. Despite the long history, many researchers have presented new approaches continuously in this area. Sound based machine fault diagnosis is the process consisting of detecting automatically the damages that affect the machines by analyzing the sounds they produce during their operating time. The collected sound is corrupted by the surrounding work environment. Therefore, the most important part of the diagnosis is to find hidden elements inside the data that can represent the error pattern. This paper presents a feature extraction methodology that combines various digital signal processing and pattern recognition methods for the analysis of the sounds produced by tools. The magnitude spectrum of the sound is extracted using the Fourier analysis and the band-pass filter is applied to further characterize the data. Statistical functions are also used as input to the nonlinear classifier for the final response. The results prove that the proposed feature extraction method accurately captures the hidden patterns of the sound generated by the tool, unlike the conventional features. Therefore, it is shown that the proposed method can be applied to a sound based automatic diagnosis system.

A Study on Sound Design to Improve Regional Image -Focused on the Jeonju Area- (지역이미지 활성을 위한 사운드 디자인에 관한 연구)

  • Kim, Mee-Shuk;Chung, Sung-Whan;Hyoun, Sung-Eun
    • Science of Emotion and Sensibility
    • /
    • v.10 no.4
    • /
    • pp.613-622
    • /
    • 2007
  • Recently, sound design is being made by corporations for the production as well as for marketing and web with consideration of image of productions and attributes to improve corporational image through the design of melody which would remain in users. And sound is becoming an important factor to establish the identity of each area such as life environment and public facilities. At present, our local governments are promoting active business like as CIP to improve urban image but there is a limit to establish identity as the result of its partial focus on visual sense or insufficient recognition about it. Jeonju, the place of sound, has many festivals and great meetings related with sound but it has not identity in the sense of sound. So the purpose of this study is to suggest the condition of sound which has the trait of Jeonju and to provide data for the trait to be used as a necessary element to establish identity in order to activate regional image. For the method of research, sampling Korean beautiful 100 sounds among the natural sounds of residents. most favorite as the samples of sound to search the sound of regional image. Selecting favorite samples among them and analyzed the factors through the questionnaire on the image of adjective in each sample. As the result of analysis, it has been shown that the factor of sound to reveal trait of Jeonju is the image of bright, delight, and cozy with consideration of harmony, dynamics, contrast, and culture. For this study is to provide data so it can be used to actively establish and identify the local image.

  • PDF

Decision Tree Learning Algorithms for Learning Model Classification in the Vocabulary Recognition System (어휘 인식 시스템에서 학습 모델 분류를 위한 결정 트리 학습 알고리즘)

  • Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.11 no.9
    • /
    • pp.153-158
    • /
    • 2013
  • Target learning model is not recognized in this category or not classified clearly failed to determine if the vocabulary recognition is reduced. Form of classification learning model is changed or a new learning model is added to the recognition decision tree structure of the model should be changed to a structural problem. In order to solve these problems, a decision tree learning model for classification learning algorithm is proposed. Phonological phenomenon reflected sound enough to configure the database to ensure learning a decision tree learning model for classifying method was used. In this study, the indoor environment-dependent recognition and vocabulary words for the experimental results independent recognition vocabulary of the indoor environment-dependent recognition performance of 98.3% in the experiment showed, vocabulary independent recognition performance of 98.4% in the experiment shown.

An Implementation of Security System Using Speaker Recognition Algorithm (화자인식 알고리즘을 이용한 보안 시스템 구축)

  • Shin, You-Shik;Park, Kee-Young;Kim, Chong-Kyo
    • Journal of the Korean Institute of Telematics and Electronics T
    • /
    • v.36T no.4
    • /
    • pp.17-23
    • /
    • 1999
  • This paper described a security system using text-independent speaker recognition algorithm. Security system is based on PIC16F84 and sound card. Speaker recognition algorithm applied a k-means based model and weighted cepstrum for speech features. As the experimental results, recognition rate of the training data is 100%, non-training data is 99%. Also false rejection rate is 1%, false acceptance rate is 0% and verification mean error rate is 0.5% for registered 5 persons.

  • PDF

A Study on the urban housewives wedding behavior and satisfaction - focus on the housewives who have been married for less than five years - (도시주부의 혼례행동 및 혼례만족에 관한 연구 - 결혼 5년 이내의 주부를 중심으로 -)

  • 이정우;김명나
    • Journal of Family Resource Management and Policy Review
    • /
    • v.1 no.2
    • /
    • pp.1-15
    • /
    • 1997
  • The purpose of this study is to investigate (1)the level of the urban housewives’behavior and satisfaction of wedding, (2)the influential factors related to the two dependent variables above mentioned. So that provides some fundamental materials to improve the level of sound wedding culture and the whole home living. The subjects were 356 housewives, in April, 1997, Seoul. The data obtained were analyzed by Mean, Pearson’s correlation, Stepwise Multiple Regression and Path Analysis. The major findings were as follows: 1) The general tendency of the housewives’wedding behavior and satisfaction was reasonable. 2) According to the background variables(ie: marital form, the existence of job, the recognition degree of her husband’s family’s living standards, the recognition degree of her parents’home’s living standards, the perception of marital transactions), the housewives’wedding behavior was significantly different. 3) According to (1)the background variables(ie: communication frequency in household, self-acceptance, the adequacy of household income, educational level), (2)intermediated variable(ie: articles essential to a marriage), the housewives’wedding satisfaction was significantly different. 4) The indirect variable of the positive influence for housewives’satisfaction of wedding was marital form, the existence of job. the indirect variable of the negative influence for housewives’satisfaction of wedding was the recognition degree of her husband’s family’s living standards, the recognition degree of her parents’home’s living standards, the perception of marital transfactions.

  • PDF

Interactive Game Designed for Early Child using Multimedia Interface : Physical Activities (멀티미디어 인터페이스 기술을 이용한 유아 대상의 체감형 게임 설계 : 신체 놀이 활동 중심)

  • Won, Hye-Min;Lee, Kyoung-Mi
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.3
    • /
    • pp.116-127
    • /
    • 2011
  • This paper proposes interactive game elements for children : contents, design, sound, gesture recognition, and speech recognition. Interactive games for early children must use the contents which reflect the educational needs and the design elements which are all bright, friendly, and simple to use. Also the games should consider the background music which is familiar with children and the narration which make easy to play the games. In gesture recognition and speech recognition, the interactive games must use gesture and voice data which hits to the age of the game user. Also, this paper introduces the development process for the interactive skipping game and applies the child-oriented contents, gestures, and voices to the game.

Robot System Design Capable of Motion Recognition and Tracking the Operator's Motion (사용자의 동작인식 및 모사를 구현하는 로봇시스템 설계)

  • Choi, Yonguk;Yoon, Sanghyun;Kim, Junsik;Ahn, YoungSeok;Kim, Dong Hwan
    • Journal of the Korean Society of Manufacturing Technology Engineers
    • /
    • v.24 no.6
    • /
    • pp.605-612
    • /
    • 2015
  • Three dimensional (3D) position determination and motion recognition using a 3D depth sensor camera are applied to a developed penguin-shaped robot, and its validity and closeness are investigated. The robot is equipped with an Asus Xtion Pro Live as a 3D depth camera, and a sound module. Using the skeleton information from the motion recognition data extracted from the camera, the robot is controlled so as to follow the typical three mode-reactions formed by the operator's gestures. In this study, the extraction of skeleton joint information using the 3D depth camera is introduced, and the tracking performance of the operator's motions is explained.