• Title/Summary/Keyword: Acoustic feature extraction

Search Result 56, Processing Time 0.02 seconds

Identification of Underwater Ambient Noise Sources Using MFCC (MFCC를 이용한 수중소음원의 식별)

  • Hwang, Do-Jin;Kim, Jea-Soo
    • Proceedings of the Korea Committee for Ocean Resources and Engineering Conference
    • /
    • 2006.11a
    • /
    • pp.307-310
    • /
    • 2006
  • Underwater ambient noise originating from the geophysical, biological, and man-made acoustic sources contains much information on the sources and the ocean environment affecting the performance of the sonar equipments. In this paper, a set of feature vectors of the ambient noises using MFCC is proposed and extracted to form a data base for the purpose of identifying the noise sources. The developed algorithm for the pattern recognition is applied to the observed ocean data, and the initial results are presented and discussed.

  • PDF

Analyzing the Acoustic Elements and Emotion Recognition from Speech Signal Based on DRNN (음향적 요소분석과 DRNN을 이용한 음성신호의 감성 인식)

  • Sim, Kwee-Bo;Park, Chang-Hyun;Joo, Young-Hoon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.1
    • /
    • pp.45-50
    • /
    • 2003
  • Recently, robots technique has been developed remarkably. Emotion recognition is necessary to make an intimate robot. This paper shows the simulator and simulation result which recognize or classify emotions by learning pitch pattern. Also, because the pitch is not sufficient for recognizing emotion, we added acoustic elements. For that reason, we analyze the relation between emotion and acoustic elements. The simulator is composed of the DRNN(Dynamic Recurrent Neural Network), Feature extraction. DRNN is a learning algorithm for pitch pattern.

Tool Condition Monitoring using AE Signal in Micro Endmilling (마이크로 엔드밀링에서 AE 신호를 이용한 공구상태 감시)

  • Kang Ik Soo;Jeong Yun Sik;Kwon Dong Hee;Kim Jeon Ha;Kim Jeong Suk;Ahn Jung Hwan
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.23 no.1 s.178
    • /
    • pp.64-71
    • /
    • 2006
  • Ultraprecision machining and MEMS technology have been taken more and more important position in machining of microparts. Micro endmilling is one of the prominent technology that has wide spectrum of application field ranging from macro parts to micro products. Also, the method of micro-grooving using micro endmill is used widely owing to many merit, but has problems of precision and quality of products due to tool wear and tool fracture. This investigation deals with state monitoring using acoustic emission(AE) signal in the micro-grooving. Characteristic evaluation of AE raw signal, AE hit and frequency analysis for condition monitoring is presented. Also, the feature extraction of AE signal directly related to machining process is executed. Then, the distinctive micro endmill state according to the each tool condition is classified by the fuzzy C-means algorithm.

A Study on Auto-Classification of Acoustic Emission Signals Using Wavelet Transform and Neural Network (웨이블렛 변환과 신경망을 이용한 음향방출신호의 자동분류에 관한연구)

  • Park, Jae-Jun;Kim, Meyoun-Soo;Oh, Seung-Heon;Kang, Tae-Rim;Kim, Sung-Hong;Beak, Kwan-Hyun;Oh, Il-Duck;Song, Young-Chul;Kwon, Dong-Jin
    • Proceedings of the KIEE Conference
    • /
    • 2000.07c
    • /
    • pp.1880-1884
    • /
    • 2000
  • The discrete wavelet transform is utilized as preprocessing of Neural Network(NN) to identify aging state of internal partial discharge in transformer. The discrete traveler transform is used to produce wavelet coefficients which are used for Classification. The statistical parameters (maximum of wavelet coefficients, average value, dispersion, skewness, kurtosis) using the wavelet coefficients are input into an back-propagation neural network. The neurons whose weights have obtained through Result of Cross-Validation. The Neural Network learning stops either when the error rate achieves an appropriate minimum or when the learning time overcomes a constant value. The networks, after training, can decide if the test signal is Early Aging State or Last Aging State or normal state.

  • PDF

Automatic proficiency assessment of Korean speech read aloud by non-natives using bidirectional LSTM-based speech recognition

  • Oh, Yoo Rhee;Park, Kiyoung;Jeon, Hyung-Bae;Park, Jeon Gue
    • ETRI Journal
    • /
    • v.42 no.5
    • /
    • pp.761-772
    • /
    • 2020
  • This paper presents an automatic proficiency assessment method for a non-native Korean read utterance using bidirectional long short-term memory (BLSTM)-based acoustic models (AMs) and speech data augmentation techniques. Specifically, the proposed method considers two scenarios, with and without prompted text. The proposed method with the prompted text performs (a) a speech feature extraction step, (b) a forced-alignment step using a native AM and non-native AM, and (c) a linear regression-based proficiency scoring step for the five proficiency scores. Meanwhile, the proposed method without the prompted text additionally performs Korean speech recognition and a subword un-segmentation for the missing text. The experimental results indicate that the proposed method with prompted text improves the performance for all scores when compared to a method employing conventional AMs. In addition, the proposed method without the prompted text has a fluency score performance comparable to that of the method with prompted text.

Machine Fault Diagnosis Method based on DWT Power Spectral Density using Multi Patten Recognition (다중 패턴 인식 기법을 이용한 DWT 전력 스펙트럼 밀도 기반 기계 고장 진단 기법)

  • Kang, Kyung-Won;Lee, Kyeong-Min;Vununu, Caleb;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.11
    • /
    • pp.1233-1241
    • /
    • 2019
  • The goal of the sound-based mechanical fault diagnosis technique is to automatically find abnormal signals in the machine using acoustic emission. Conventional methods of using mathematical models have been found to be inaccurate due to the complexity of industrial mechanical systems and the existence of nonlinear factors such as noise. Therefore, any fault diagnosis issue can be treated as a pattern recognition problem. We propose an automatic fault diagnosis method using discrete wavelet transform and power spectrum density using multi pattern recognition. First, we perform DWT-based filtering analysis for noise cancelling and effective feature extraction. Next, the power spectral density(PSD) is performed on each subband of the DWT in order to effectively extract feature vectors of sound. Finally, each PSD data is extracted with the features of the classifier using multi pattern recognition. The results show that the proposed method can not only be used effectively to detect faults as well as apply to various automatic diagnosis system based on sound.

A Study on Diagnosis of Partial Discharge Type Using Wavelet Transform-Neural Network (웨이블렛-신경망을 이용한 부분방전 종류와 진단에 관한연구)

  • Park, Jae-Jun;Jeon, Hyun-Gu;Jeon, Byung-Hoon;Kim, Sung-Hong;Kwon, Dong-Jin
    • Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
    • /
    • 2002.07b
    • /
    • pp.894-899
    • /
    • 2002
  • In this papers, we proposed the new method in order to diagnosis partial discharge type of transformers. For wavelet transform, Daubechies filter is used, we can obtain wavelet coefficients which is used to extract feature of statistical parameters (maximum value, average value, dispersion, skewness, kurtosis) about high frequency current signal per 3-electrode type (needle-plane electrode, IEC electrode and Void electrode.). Also. these coefficients are used to identify Signal of internal partial discharge in transformer. As a result. from compare of high frequency current signal amplitude and average value. we are obtained results of IEC electrode> Void electrode> Needle-Plane electrode. otherwise. In case of skewness and kurtosis, we are obtained results of Void electrode> IEC electrode > Needle-Plane electrode. As Improved method in order to diagnosis partial discharge type of transformers, we use neural network.

  • PDF

Implementation of HMM-Based Speech Recognizer Using TMS320C6711 DSP

  • Bae Hyojoon;Jung Sungyun;Bae Keunsung
    • MALSORI
    • /
    • no.52
    • /
    • pp.111-120
    • /
    • 2004
  • This paper focuses on the DSP implementation of an HMM-based speech recognizer that can handle several hundred words of vocabulary size as well as speaker independency. First, we develop an HMM-based speech recognition system on the PC that operates on the frame basis with parallel processing of feature extraction and Viterbi decoding to make the processing delay as small as possible. Many techniques such as linear discriminant analysis, state-based Gaussian selection, and phonetic tied mixture model are employed for reduction of computational burden and memory size. The system is then properly optimized and compiled on the TMS320C6711 DSP for real-time operation. The implemented system uses 486kbytes of memory for data and acoustic models, and 24.5 kbytes for program code. Maximum required time of 29.2 ms for processing a frame of 32 ms of speech validates real-time operation of the implemented system.

  • PDF

Design of a Korean Speech Recognition Platform (한국어 음성인식 플랫폼의 설계)

  • Kwon Oh-Wook;Kim Hoi-Rin;Yoo Changdong;Kim Bong-Wan;Lee Yong-Ju
    • MALSORI
    • /
    • no.51
    • /
    • pp.151-165
    • /
    • 2004
  • For educational and research purposes, a Korean speech recognition platform is designed. It is based on an object-oriented architecture and can be easily modified so that researchers can readily evaluate the performance of a recognition algorithm of interest. This platform will save development time for many who are interested in speech recognition. The platform includes the following modules: Noise reduction, end-point detection, met-frequency cepstral coefficient (MFCC) and perceptually linear prediction (PLP)-based feature extraction, hidden Markov model (HMM)-based acoustic modeling, n-gram language modeling, n-best search, and Korean language processing. The decoder of the platform can handle both lexical search trees for large vocabulary speech recognition and finite-state networks for small-to-medium vocabulary speech recognition. It performs word-dependent n-best search algorithm with a bigram language model in the first forward search stage and then extracts a word lattice and restores each lattice path with a trigram language model in the second stage.

  • PDF

Performance Comparison of Guitar Chords Classification Systems Based on Artificial Neural Network (인공신경망 기반의 기타 코드 분류 시스템 성능 비교)

  • Park, Sun Bae;Yoo, Do-Sik
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.3
    • /
    • pp.391-399
    • /
    • 2018
  • In this paper, we construct and compare various guitar chord classification systems using perceptron neural network and convolutional neural network without pre-processing other than Fourier transform to identify the optimal chord classification system. Conventional guitar chord classification schemes use, for better feature extraction, computationally demanding pre-processing techniques such as stochastic analysis employing a hidden markov model or an acoustic data filtering and hence are burdensome for real-time chord classifications. For this reason, we construct various perceptron neural networks and convolutional neural networks that use only Fourier tranform for data pre-processing and compare them with dataset obtained by playing an electric guitar. According to our comparison, convolutional neural networks provide optimal performance considering both chord classification acurracy and fast processing time. In particular, convolutional neural networks exhibit robust performance even when only small fraction of low frequency components of the data are used.