• Title/Summary/Keyword: LPC Coefficients

Search Result 79, Processing Time 0.032 seconds

Neural-network-based Driver Drowsiness Detection System Using Linear Predictive Coding Coefficients and Electroencephalographic Changes (선형예측계수와 뇌파의 변화를 이용한 신경회로망 기반 운전자의 졸음 감지 시스템)

  • Chong, Ui-Pil;Han, Hyung-Seob
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.13 no.3
    • /
    • pp.136-141
    • /
    • 2012
  • One of the main reasons for serious road accidents is driving while drowsy. For this reason, drowsiness detection and warning system for drivers has recently become a very important issue. Monitoring physiological signals provides the possibility of detecting features of drowsiness and fatigue of drivers. One of the effective signals is to measure electroencephalogram (EEG) signals and electrooculogram (EOG) signals. The aim of this study is to extract drowsiness-related features from a set of EEG signals and to classify the features into three states: alertness, drowsiness, sleepiness. This paper proposes a neural-network-based drowsiness detection system using Linear Predictive Coding (LPC) coefficients as feature vectors and Multi-Layer Perceptron (MLP) as a classifier. Samples of EEG data from each predefined state were used to train the MLP program by using the proposed feature extraction algorithms. The trained MLP program was tested on unclassified EEG data and subsequently reviewed according to manual classification. The classification rate of the proposed system is over 96.5% for only very small number of samples (250ms, 64 samples). Therefore, it can be applied to real driving incident situation that can occur for a split second.

Electroencephalogram-based Driver Drowsiness Detection System Using AR Coefficients and SVM (AR계수와 SVM을 이용한 뇌파 기반 운전자의 졸음 감지 시스템)

  • Han, Hyungseob;Chong, Uipil
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.6
    • /
    • pp.768-773
    • /
    • 2012
  • One of the main reasons for serious road accidents is driving while drowsy. For this reason, drowsiness detection and warning system for drivers has recently become a very important issue. Monitoring physiological signals provides the possibility of detecting features of drowsiness and fatigue of drivers. One of the effective signals is to measure electroencephalogram (EEG) signals and electrooculogram (EOG) signals. The aim of this study is to extract drowsiness-related features from a set of EEG signals and to classify the features into three states: alertness, drowsiness, sleepiness. This paper proposes a drowsiness detection system using Linear Predictive Coding (LPC) coefficients and Support Vector Machine (SVM). Samples of EEG data from each predefined state were used to train the SVM program by using the proposed feature extraction algorithms. The trained SVM program was tested on unclassified EEG data and subsequently reviewed according to manual classification. The classification rate of the proposed system is over 96.5% for only very small number of samples (250ms, 64 samples). Therefore, it can be applied to real driving incident situation that can occur for a split second.

A new method of Extracting the Filter Characteristics of the Nasal Cavity Using Homorganic Nasal-Stop Sequences: A Preliminary Report (동기관음의 스펙트럼 차이를 이용한 비강 특성 산출: 예비 연구)

  • Park, Han-Sang
    • MALSORI
    • /
    • no.53
    • /
    • pp.17-35
    • /
    • 2005
  • A New Method of Extracting the Filter Characteristics of the Nasal Cavity Using Homorganic Nasal-Stop Sequences: A Preliminary R eportHansang ParkThis study provides a new method of extracting the filter characteristics of the nasal cavity. Korean lenis stops are realized as voiced in the homorganic nasal-lenis stop sequences between vowels. Since the only difference between the two members of the homorganic nasal- lenis stop sequences, such as [mb], [nd], and [ g], is whether the passage to the nasal cavity is open or not, the subtraction of the LPC spectrum of the voiced stop from that of the preceding nasal leads to the filter characteristics of the nasal cavity of an individual speaker regardless of place of articulation. The results suggest that various attempts should be made to extract a robust filter characteristics of the nasal cavity by giving variation to LPC coefficients and by paying particular attention to speech samples. This study is significant in that it provides a preliminary report about a new method of extracting the filter characteristics of the nasal cavity.

  • PDF

Classification of PVC(Premature Ventricular Contraction) using Radial Basis Function network (Radial Basis Function 네트워크를 이용한 PVC 분류)

  • Lee, J.;Lee, K.J.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.11
    • /
    • pp.439-442
    • /
    • 1997
  • In our research, we will extract diagnostic parameters by LPC method and wavelet transform. Then, we will design artificial neural network which is based on RBF that can express input features in terms of fuzzy. Because PVC(Premature Ventricular Contraction) has possibility to cause heart attack, the detection of PVC is a very significant problem. To deal with this problem, LPC method which gives different coefficients or different morphologies and wavelet transform which has superior localization nature of time-frequency, are used to extract effective parameters or classification of normal and PVC. Because RBF network can allocate an input feature to the membership degree of each category, total system will be more flexible.

  • PDF

Voice conversion using low dimensional vector mapping (낮은 차원의 벡터 변환을 통한 음성 변환)

  • Lee, Kee-Seung;Doh, Won;Youn, Dae-Hee
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.4
    • /
    • pp.118-127
    • /
    • 1998
  • In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.

  • PDF

A Study on Speech Recognition by One Stage MSVQ/DP (One stage MSVQ/DP를 이용한 음성 인식에 관한연구)

  • Jeoung, Eui-Bung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.2
    • /
    • pp.5-12
    • /
    • 1994
  • This paper proposes One Stage MSVQ/DP method for word recognition system university administration branch names are selected for the recognition experiment and 10 LPC cepstrum coefficients is used as the feature parameter. Besides the speech recognition experiments by proposed method, for comparision with it, we perform the experiments on the same data by Level Building DTW and One Stage DP method. The Recognition rates with the LBDTW and the One Stage method are $83.3\%$ and $87.5\%$, but the recognition rate with the proposed method is $91.6\%$.

  • PDF

Design and Implementation of Simple Text-to-Speech System using Phoneme Units (음소단위를 이용한 소규모 문자-음성 변환 시스템의 설계 및 구현)

  • Park, Ae-Hee;Yang, Jin-Woo;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.3
    • /
    • pp.49-60
    • /
    • 1995
  • This paper is a study on the design and implementation of the Korean Text-to-Speech system which is used for a small and simple system. In this paper, a parameter synthesis method is chosen for speech syntheiss method, we use PARCOR(PARtial autoCORrelation) coefficient which is one of the LPC analysis. And we use phoneme for synthesis unit which is the basic unit for speech synthesis. We use PARCOR, pitch, amplitude as synthesis parameter of voice, we use residual signal, PARCOR coefficients as synthesis parameter of unvoice. In this paper, we could obtain the 60% intelligibility by using the residual signal as excitation signal of unvoiced sound. The result of synthesis experiment, synthesis of a word unit is available. The controlling of phoneme duration is necessary for synthesizing of a sentence unit. For setting up the synthesis system, PC 486, a 70[Hz]-4.5[KHz] band pass filter for speech input/output, amplifier, and TMS320C30 DSP board was used.

  • PDF

Pattern Classification of Four Emotions using EEG (뇌파를 이용한 감정의 패턴 분류 기술)

  • Kim, Dong-Jun;Kim, Young-Soo
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.3 no.4
    • /
    • pp.23-27
    • /
    • 2010
  • This paper performs emotion classification test to find out the best parameter of electroencyphalogram(EEG) signal. Linear predictor coefficients, band cross-correlation coefficients of fast Fourier transform(FFT) and autoregressive model spectra are used as the parameters of 10-channel EEG signal. A multi-layer neural network is used as the pattern classifier. Four emotions for relaxation, joy, sadness, irritation are induced by four university students of an acting circle. Electrode positions are Fp1, Fp2, F3, F4, T3, T4, P3, P4, O1, O2. As a result, the Linear predictor coefficients showed the best performance.

  • PDF

A Study on the recognition of local name using Spatio-Temporal method (Spatio-temporal방법을 이용한 지역명 인식에 관한 연구)

  • 지원우
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1993.06a
    • /
    • pp.121-124
    • /
    • 1993
  • This paper is a study on the word recognition using neural network. A limited vocabulary, speaker independent, isolated word recognition system has been built. This system recognizes isolated word without performing segmentation, phoneme identification, or dynamic time wrapping. It needs a static pattern approach to recognize a spatio-temporal pattern. The preprocessing only includes preceding and tailing silence removal, and word length determination. A LPC analysis is performed on each of 24 equally spaced frames. The PARCOR coefficients plus 3 other features from each frame is extracted. In order to simplify a structure of neural network, we composed binary code form to decrease output nodes.

  • PDF

Korean vowel recognition in noise using auditory model

  • Shim, Jae-Seong;Lee, Jae-Hyuk;Yoon, Tae-Sung;Beack, Seung-Hwa;Park, Sang-Hui
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1988.10b
    • /
    • pp.1037-1040
    • /
    • 1988
  • In this study, we performed the recognition test on Korean vowel using peripheral auditory model. In addition, for the purpose of objective comparision, the recognition test is performed by extracting LPC cepstrum coefficients from the same data. And the same speech data are mixed with the Guaussian white noise quantitatively, then we repeated the same test, too. So we verified that this auditory model has a adaptability on noise.

  • PDF