• 제목/요약/키워드: Recognition time reduction

검색결과 125건 처리시간 0.022초

절주프로그램의 인지도 및 이용의도에 영향을 미치는 요인 연구 (Study on the Recognition and Behavioral Intention for Alcohol-reduction Programs)

  • 장혜정;심재선;박종애
    • 보건교육건강증진학회지
    • /
    • 제21권1호
    • /
    • pp.243-257
    • /
    • 2004
  • Alcohol consumption is a major source of health problems, for example, alchol consumption is related to liver diseases. In addition, the social and economic costs related to alcohol consumption are enormous. This study was conducted to evaluate the current status and influencing factors related to the recognition and behavioral intention for both drinking and alcohol-reduction programs. Three effective alcohol-reduction programs of clinic program, mass education, and alliance were considered. To explain the health behavior for drinking and alcohol-reduction programs, a five-stage behavioral intention model was built and 500 questionnaires were completed through a telephone survey. Stages of the model composed of recognition of the programs, past experiences, present drinking status, intention for drinking, and behavioral intention for alcohol-reduction programs. As a result, recognition rates of the programs were low in general, therefore the strategies of education, public relations, and advertisement need to be pursued. The alcohol dependency resulted in the fact that success rate was 30% although trial rate of alcohol-reducing was 23%. The necessity of alcohol-reduction programs were suggested. In addition, significant factors related to the intention for alcohol-reducing were individual attitude and reluctancy to pay their time and money. An insignificant factor was the attitude to their alcohol-reduction by other people. Behavioral intention rates for alcohol-reducing clinics were 4%, and those for mass education were 8%. There were very low purchase rates for clinic program, mass education, and alliance. In conclusion, evidenced-based and effective alcohol-reduction programs need to be encouraged to drinkers by medical doctors, and the strategies of education, public relations, and advertisement are also recommended. In addition, continuing legal and systematic support for alcohol-reducing would lower the drinking rate and ultimately contribute to the nation's health promotion.

고차통계 정규화를 이용한 강인한 음성인식 (Robust Speech Recognition Using Real-Time Higher Order Statistics Normalization)

  • 정주현;송화전;김형순
    • 대한음성학회지:말소리
    • /
    • 제54호
    • /
    • pp.63-72
    • /
    • 2005
  • The performance of speech recognition system is degraded by the mismatch between training and test environments. Many studies have been presented to compensate for noise components in the cepstral domain. Recently, higher order cepstral moment normalization method has been introduced to improve recognition accuracy. In this paper, we present real-time high order moment normalization method with post-processing smoothing filter to reduce the parameter estimation error in higher order moment computation. In experiments using Aurora2 database, we obtained error rate reduction of 44.7% with proposed algorithm in comparison with baseline system.

  • PDF

Vehicle Image Recognition Using Deep Convolution Neural Network and Compressed Dictionary Learning

  • Zhou, Yanyan
    • Journal of Information Processing Systems
    • /
    • 제17권2호
    • /
    • pp.411-425
    • /
    • 2021
  • In this paper, a vehicle recognition algorithm based on deep convolutional neural network and compression dictionary is proposed. Firstly, the network structure of fine vehicle recognition based on convolutional neural network is introduced. Then, a vehicle recognition system based on multi-scale pyramid convolutional neural network is constructed. The contribution of different networks to the recognition results is adjusted by the adaptive fusion method that adjusts the network according to the recognition accuracy of a single network. The proportion of output in the network output of the entire multiscale network. Then, the compressed dictionary learning and the data dimension reduction are carried out using the effective block structure method combined with very sparse random projection matrix, which solves the computational complexity caused by high-dimensional features and shortens the dictionary learning time. Finally, the sparse representation classification method is used to realize vehicle type recognition. The experimental results show that the detection effect of the proposed algorithm is stable in sunny, cloudy and rainy weather, and it has strong adaptability to typical application scenarios such as occlusion and blurring, with an average recognition rate of more than 95%.

A Study on the Optimal Mahalanobis Distance for Speech Recognition

  • Lee, Chang-Young
    • 음성과학
    • /
    • 제13권4호
    • /
    • pp.177-186
    • /
    • 2006
  • In an effort to enhance the quality of feature vector classification and thereby reduce the recognition error rate of the speaker-independent speech recognition, we employ the Mahalanobis distance in the calculation of the similarity measure between feature vectors. It is assumed that the metric matrix of the Mahalanobis distance be diagonal for the sake of cost reduction in memory and time of calculation. We propose that the diagonal elements be given in terms of the variations of the feature vector components. Geometrically, this prescription tends to redistribute the set of data in the shape of a hypersphere in the feature vector space. The idea is applied to the speech recognition by hidden Markov model with fuzzy vector quantization. The result shows that the recognition is improved by an appropriate choice of the relevant adjustable parameter. The Viterbi score difference of the two winners in the recognition test shows that the general behavior is in accord with that of the recognition error rate.

  • PDF

비선형 특징투영 기법을 이용한 웨이블렛 기반 근전도 패턴인식 (A Wavelet-Based EMG Pattern Recognition with Nonlinear Feature Projection)

  • 추준욱;문인혁
    • 전자공학회논문지SC
    • /
    • 제42권2호
    • /
    • pp.39-48
    • /
    • 2005
  • 본 논문에서는 다기능 근전의수를 제어하기 위해 전완에서 취득한 4 채널의 근전도로부터 9 가지 동작을 인식하는 새로운 방법을 제안한다. 비정상 신호특성을 가진 근전도를 해석하기 위해서 시간-주파수 영역에서 표현되는 특징벡터를 웨이블렛 패킷변환을 통해 추출한다. 높은 차원을 가지는 시간-주파수 특징벡터에 대하여 차원축소와 비선형변환을 수행하기 위해 PCA와 SOFM으로 구성된 특징투영 방법을 제안한다. PCA를 이용한 차원축소는 패턴분류기의 구조를 단순화하고 패턴인식을 위한 계산시간을 단축할 수 있다. SOFM을 이용한 비선형변환은 PCA에 의해 차원이 축소된 특징벡터를 새로운 공간으로 투영함으로써 클래스 분리도를 향상시킨다. 마지막으로 각 동작은 패턴분류기인 다층 신경회로망에 의해 인식된다. 실험 결과로부터 제안한 방법이 높은 인식률을 보임과 동시에 연속적인 패턴인식을 위한 실시간 구현이 가능함을 보인다.

한국어 음성인식 플랫폼의 설계 (Design of a Korean Speech Recognition Platform)

  • 권오욱;김회린;유창동;김봉완;이용주
    • 대한음성학회지:말소리
    • /
    • 제51호
    • /
    • pp.151-165
    • /
    • 2004
  • For educational and research purposes, a Korean speech recognition platform is designed. It is based on an object-oriented architecture and can be easily modified so that researchers can readily evaluate the performance of a recognition algorithm of interest. This platform will save development time for many who are interested in speech recognition. The platform includes the following modules: Noise reduction, end-point detection, met-frequency cepstral coefficient (MFCC) and perceptually linear prediction (PLP)-based feature extraction, hidden Markov model (HMM)-based acoustic modeling, n-gram language modeling, n-best search, and Korean language processing. The decoder of the platform can handle both lexical search trees for large vocabulary speech recognition and finite-state networks for small-to-medium vocabulary speech recognition. It performs word-dependent n-best search algorithm with a bigram language model in the first forward search stage and then extracts a word lattice and restores each lattice path with a trigram language model in the second stage.

  • PDF

대용량 음성인식을 위한 인식기간 감축 알고리즘 (A Recognition Time Reduction Algorithm for Large-Vocabulary Speech Recognition)

  • 구준모;은종관
    • 한국음향학회지
    • /
    • 제10권3호
    • /
    • pp.31-36
    • /
    • 1991
  • 본 논문에서는 대용량 음성인식 시스템의 인식시간을 감축하기 위하여 후보단어를 선정하는 효과적인 방법을 제안하고 이 방법의 성능을 향상시키기 위하여 spectral smoothing과 temporal smoothing을 사용하는 것에 관하여 연구하였다. 제안된 방법은 사전내의 각 단어에 대하여 음성인식 단위의 음성 spectrum관찰확률과 길이정보를 이용하여 대강의 관찰확률을 계산하여 후보단어를 선정한다. 제안된 방법을 음소단위의 HMM을 이용하는 1160단어 인식 시스템에 적용한 결과, 전체 계산량의 74% 가량을 감축할 수 있었으며 이때 인식율의 감소는 매우 작았다. 또한 제안된 대감의 likelihood점수 계산방법은 Viterbi방법에 의하여 계산되는 likelihood 점수를 잘 추정함을 알 수 있었다.

  • PDF

TMS320C6201 DSP를 이용한 HMM 기반의 음성인식기 구현 (Implementation of HMM Based Speech Recognizer with Medium Vocabulary Size Using TMS320C6201 DSP)

  • 정성윤;손종목;배건성
    • The Journal of the Acoustical Society of Korea
    • /
    • 제25권1E호
    • /
    • pp.20-24
    • /
    • 2006
  • In this paper, we focused on the real time implementation of a speech recognition system with medium size of vocabulary considering its application to a mobile phone. First, we developed the PC based variable vocabulary word recognizer having the size of program memory and total acoustic models as small as possible. To reduce the memory size of acoustic models, linear discriminant analysis and phonetic tied mixture were applied in the feature selection process and training HMMs, respectively. In addition, state based Gaussian selection method with the real time cepstral normalization was used for reduction of computational load and robust recognition. Then, we verified the real-time operation of the implemented recognition system on the TMS320C6201 EVM board. The implemented recognition system uses memory size of about 610 kbytes including both program memory and data memory. The recognition rate was 95.86% for ETRI 445DB, and 96.4%, 97.92%, 87.04% for three kinds of name databases collected through the mobile phones.

타언어권 화자 음성 인식을 위한 혼잡도에 기반한 다중발음사전의 최적화 기법 (Optimizing Multiple Pronunciation Dictionary Based on a Confusability Measure for Non-native Speech Recognition)

  • 김민아;오유리;김홍국;이연우;조성의;이성로
    • 대한음성학회지:말소리
    • /
    • 제65호
    • /
    • pp.93-103
    • /
    • 2008
  • In this paper, we propose a method for optimizing a multiple pronunciation dictionary used for modeling pronunciation variations of non-native speech. The proposed method removes some confusable pronunciation variants in the dictionary, resulting in a reduced dictionary size and less decoding time for automatic speech recognition (ASR). To this end, a confusability measure is first defined based on the Levenshtein distance between two different pronunciation variants. Then, the number of phonemes for each pronunciation variant is incorporated into the confusability measure to compensate for ASR errors due to words of a shorter length. We investigate the effect of the proposed method on ASR performance, where Korean is selected as the target language and Korean utterances spoken by Chinese native speakers are considered as non-native speech. It is shown from the experiments that an ASR system using the multiple pronunciation dictionary optimized by the proposed method can provide a relative average word error rate reduction of 6.25%, with 11.67% less ASR decoding time, as compared with that using a multiple pronunciation dictionary without the optimization.

  • PDF

지정맥 인식을 위한 특징 검출 알고리즘 개발 (Development of Feature Extraction Algorithm for Finger Vein Recognition)

  • 김태훈;이상준
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제7권9호
    • /
    • pp.345-350
    • /
    • 2018
  • 본 연구는 지정맥 인식에 중요한 정맥 패턴 특징검출을 위한 알고리즘이다. 특징검출 알고리즘은 패턴인식 시 인식결과에 많은 영향을 끼치므로 중요하다. 인식률은 손가락 위치 변화에 따라 기준도 변화되므로 저하되는 특징을 가지고 있다. 또한, 손가락에 적외선 광을 조사하여 획득한 영상은 영상 배경과 혈관 패턴을 분리하기에 어렵고, 영상 전처리과정을 수행하므로 검출시간이 증대되는 특징을 가지고 있다. 이를 위해, 제시하는 알고리즘은 영상 전처리과정이 없이 수행되어 검출 시간을 줄일 수 있고, 지정맥 영상에 SWDA(Shifted Waveform Data Analysis) 알고리즘을 적용하여 손가락 마디 위치 및 정맥 패턴 검출이 가능한 특징을 가지고 있다. 적외선 투과율이 낮아 상대적으로 어두운 정맥 영상도 검출 오류 최소화가 가능한 특징을 보였다. 또한, 손가락 마디 위치는 분류 단계에서 기준으로 활용하면 인식률 저하를 보완할 수 있는 특징을 가지고 있다. 추후 손바닥, 손목 등 신체 여러 인식분야에 제안하는 알고리즘을 적용한다면 생체 특징 검출 정확도 향상 및 인식 수행 시간 감소에 기여할 것으로 기대된다.