• 제목/요약/키워드: linear predictive

검색결과 509건 처리시간 0.028초

주행중인 자동차 환경에서의 음성인식 연구 (A Study on Speech Recognition in a running automobile)

  • 유봉근
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 학술발표대회 논문집 제17권 1호
    • /
    • pp.47-50
    • /
    • 1998
  • 본 논문은 자동차의 편의성 및 안전성의 동시 확보를 위하여, 보조적 스위치의 조작없이 상시 음성의 입,출력이 가능하도록 하며, band pass filter를 이용하여 잡음환경에서 자동으로 정확하게 음성구간 검출(End Point Detection)을 하게 하였다. Reference Pattern은 Dynamic Multi-Section(DMS)[1] 모델을 사용하였고 차량의 속도에 따라 자동으로 잡음환경에 강인한 모델을 선택하도록 하였으며, 음성의 특징 파라미터와 인식 알고리즘은 Perceptual Linear Predictive(PLP) 13차와 One Stage Dynamic Programming(OSDP)를 사용하였다. 주행중인 자동차 환경(30~70km/h)에서 자주 사용되는 차량제어 명령 33개에 대하여 화자독립 92.98%, 화자종속 94.44% 인식율을 구하였다. 또한 주행중인 차량에서 카폰, 핸드폰 사용으로 인한 사고를 줄이기 위하여 음성으로 전화를 걸 수 있도록 하는 Voice Dialing 기능도 구현하였다.

  • PDF

음성인식을 이용한 Windows 95 제어 시스템의 구현 (The Implementation of Windows 95 Control System with Speech Recognition)

  • 남동선
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 학술발표대회 논문집 제17권 1호
    • /
    • pp.43-46
    • /
    • 1998
  • 본 논문은 컴퓨터 사용에 미숙한 초보자나 키보드나 마우스를 사용할 수 없는 신체적인 조건을 가진 장애인 또는 PC사용에 미숙한 사용자들을 위해 기존의 인터페이스에 추가적으로 음성을 사용하여 더 효율적인 작업 환경을 만들기 위한 음성을 이용한 Window95 환경에서의 음성 인식 시스템 구현에 관한 것이다. 인터페이스 구현을 위해 사용되는 인식 알고리즘으로는 연결어 인식에 사용되는 OSDP[1] 알고리즘을 단독어 인식에 적용하여 사용하였다. 특징 벡터는 화자 독립적인 특성을 지닌 Perceptual Linear Predictive(PLP)[2] 13차 계수를 사용하였다. 인식 대상 어휘는 윈도우 사용자에게 자주 사용되는 60개의 명령어로 설정하였다. 인식된 후 그 결과는 구현된 시스템의 명령 실행 모듈로 전달되어 윈도우 상에서 실제 수행된다. 구현된 시스템에서는 노트북 내장 마이크를 사용하여 음성을 검출하였고 이를 위한 음성 구간 검출 알고리즘을 사용하였다. 기준 패턴은 20대 남성화자 9인이 2회 발성한 데이터를 이용하였고, 화자 독립으로 온라인 인식률은 91.71%이고, 오프라인 인식률은 96.4%의 인식률을 얻었다.

  • PDF

마할라노비스-다구치 시스템과 로지스틱 회귀의 성능비교 : 사례연구 (Performance Comparison of Mahalanobis-Taguchi System and Logistic Regression : A Case Study)

  • 이승훈;임근
    • 대한산업공학회지
    • /
    • 제39권5호
    • /
    • pp.393-402
    • /
    • 2013
  • The Mahalanobis-Taguchi System (MTS) is a diagnostic and predictive method for multivariate data. In the MTS, the Mahalanobis space (MS) of reference group is obtained using the standardized variables of normal data. The Mahalanobis space can be used for multi-class classification. Once this MS is established, the useful set of variables is identified to assist in the model analysis or diagnosis using orthogonal arrays and signal-to-noise ratios. And other several techniques have already been used for classification, such as linear discriminant analysis and logistic regression, decision trees, neural networks, etc. The goal of this case study is to compare the ability of the Mahalanobis-Taguchi System and logistic regression using a data set.

Quantitative structure activity relationship (QSAR) between chlorinated alkene ELUMO and their chlorine

  • Tang, Walter Z.;Wang, Fang
    • Advances in environmental research
    • /
    • 제1권4호
    • /
    • pp.257-276
    • /
    • 2012
  • QSAR models for chlorinated alkenes were developed between $E_{HOMO}$ and their chlorine and carbon content. The aim is to provide valid QSAR model which is statistically validated for $E_{LUMO}$ prediction. Different molecular descriptors, $N_{Cl}$, $N_C$ and $E_{HOMO}$ have been used to take into account relevant information provided by molecular features and physicochemical properties. The best model were selected using Partial Least Square (PLS) and Multiple Linear Regression (MLR) led to models with satisfactory predictive ability for a data set of 15 chlorinated alkene compounds.

반사계수를 이용한 이동물체 식별에 관한 연구 (Identification of moving targets based on reflection coefficients)

  • 박진욱;유근호;황춘식;이수동;강오영;호광춘
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1988년도 추계학술대회 논문집 학회본부
    • /
    • pp.469-472
    • /
    • 1988
  • This paper deals with signal processing and pattern recognition for the pulsed doppler radar. In order to identify the class of moving targets detected by radar, linear predictive analysis is utilized to extract reflection coefficients of each radar signal as features, and Bayes decision theory is applied to classify them.

  • PDF

전폐절제술시 폐관류스캔을 이용한 폐기능의 예측에 대한 평가 (Evaluation of the Predictive Pulmonary Function after Pneumonectomy Using Perfusion Lung Scan)

  • 김길동;정경영
    • Journal of Chest Surgery
    • /
    • 제28권4호
    • /
    • pp.371-375
    • /
    • 1995
  • Surgical resection of lung cancer or other disease is recently required in patients with severely impaired lung function resulting from chronic obstructive pulmonary disease or disease extension. So prediction of pulmonary function after lung resection is very important in thoracic surgeon. We studied the accuracy of the prediction of postoperative pulmonary function using perfusion lung scan with 99m technetium macroaggregated albumin in 22 patients who received the pneumonectomy. The linear regression line derived from correlation between predicting[X and postoperative measured[Y values of FEV1 and FVC in patients are as follows: 1 Y[ml =0.713X + 381 in FEV1 [r=0.719 ,[P<0.01 2 Y[ml =0.645X + 556 in FVC [r=0.675 ,[P<0.01 In conclusion,the perfusion lung scan is noninvasive and very accurate for predicting postpneumonectomy pulmonary function.

  • PDF

Crown Ratio Models for Tectona grandis (Linn. f) Stands in Osho Forest Reserve, Oyo State, Nigeria

  • Popoola, F.S.;Adesoye, P.O.
    • Journal of Forest and Environmental Science
    • /
    • 제28권2호
    • /
    • pp.63-67
    • /
    • 2012
  • Crown ratio is the ratio of live crown length to tree height. It is often used as an important predictor variable for tree growth equation. It indicates tree vigor and is a useful parameter in forest health assessment. The objective of the study was to develop crown ratio prediction models for Tectona grandis. Based on the data set from the temporary sample plots, several non linear equations including logistics, Chapman Richard and exponential functions were tested. These functions were evaluated in terms of coefficient of determination ($R^2$) and standard error of the estimate (SEE). The significance of the estimated parameters was also verified. Plot of residuals against estimated crown ratios were observed. Although the logistic model had the highest $R^2$ and the least SEE, Chapman-Richard and Exponential functions were observed to be more consistent in their predictive ability; and were therefore recommended for predicting crown ratio in the stand.

자유표면에서의 수중함 심도제어 시스템 성능 개선 (Performance Enhancement of Auto-Depth Control System for Submersed Body in Near Surface Environment)

  • 이석필;윤형식;박상희
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1991년도 한국자동제어학술회의논문집(국내학술편); KOEX, Seoul; 22-24 Oct. 1991
    • /
    • pp.637-641
    • /
    • 1991
  • One of the most difficult problems in depth control for underwater vehicle is the effect of seaway disturbance. When a underwater vehicle operates in a near surface environment, the seaway generates essentially two types of stochastic disturbances that influence the boat notion. One component of the seaway forces is of large magnitude with a relatively narrow-band, first order component. The other component is generally of somewhat smaller magnitude, second order component. Since the magnitude of the first order component is generally such greater than the compensating force that can be generating by the planes, it is undesirable for the controller to generate a control command. In this paper, we used LPC(Linear Predictive Coding) processing to uncontrollable seaway disturbance. This method can be used extensively in sensor signal processing of underwater vehicles.

  • PDF

자동차 소음 환경에서 음성 인식 (Speech Recognition in the Car Noise Environment)

  • 김완구;차일환;윤대희
    • 전자공학회논문지B
    • /
    • 제30B권2호
    • /
    • pp.51-58
    • /
    • 1993
  • This paper describes the development of a speaker-dependent isolated word recognizer as applied to voice dialing in a car noise environment. for this purpose, several methods to improve performance under such condition are evaluated using database collected in a small car moving at 100km/h The main features of the recognizer are as follow: The endpoint detection error can be reduced by using the magnitude of the signal which is inverse filtered by the AR model of the background noise, and it can be compensated by using variants of the DTW algorithm. To remove the noise, an autocorrelation subtraction method is used with the constraint that residual energy obtainable by linear predictive analysis should be positive. By using the noise rubust distance measure, distortion of the feature vector is minimized. The speech recognizer is implemented using the Motorola DSP56001(24-bit general purpose digital signal processor). The recognition database is composed of 50 Korean names spoken by 3 male speakers. The recognition error rate of the system is reduced to 4.3% using a single reference pattern for each word and 1.5% using 2 reference patterns for each word.

  • PDF

Dual MAC을 이용한 음성 부호화기용 피치 매개변수 검색 구조 설계 (Design of pitch parameter search architecture for a speech coder using dual MACs)

  • 박주현;심재술;김영민
    • 전자공학회논문지A
    • /
    • 제33A권5호
    • /
    • pp.172-179
    • /
    • 1996
  • In the paper, QCELP (qualcomm code excited linear predictive), CDMA (code division multiple access)'s vocoder algorithm, was analyzed. And then, a ptich parameter seaarch architecture for 16-bit programmable DSP(digital signal processor) for QCELP was designed. Because we speed up the parameter search through high speed DSP using two MACs, we can satisfy speech codec specifiction for the digital celluar. Also, we implemented in FIFO(first-in first-out) memory using register file to increase the access time of data. This DSP was designed using COMPASS, ASIC design tool, by top-down design methodology. Therefore, it is possible to cope with rapid change at mobile communication market.

  • PDF