• 제목/요약/키워드: Recognition Evaluation

검색결과 1,298건 처리시간 0.027초

Hidden Markov Network 음성인식 시스템의 성능평가에 관한 연구 (A Study on Performance Evaluation of Hidden Markov Network Speech Recognition System)

  • 오세진;김광동;노덕규;위석오;송민규;정현열
    • 융합신호처리학회논문지
    • /
    • 제4권4호
    • /
    • pp.30-39
    • /
    • 2003
  • 본 논문에서는 한국어 음성 데이터를 대상으로 HM-Net(Hidden Markov Network) 음성인식 시스템의 성능평가를 수행하였다. 음향모델 작성은 음성인식에서 널리 사용되고 있는 통계적인 모델링 방법인 HMM(Hidden Markov Model)을 개량한 HM-Net을 도입하였다. HM-Net은 기존의 SSS(Successive State Splitting) 알고리즘을 개량한 PDT(Phonetic Decision Tree)-SSS 알고리즘에 의해 문맥방향과 시간방향의 상태분할을 수행하여 생성되는데, 특히 문맥방향 상태분할의 경우 학습 음성데이터에 출현하지 않는 문맥정보를 효과적으로 표현하기 위해 음소결정트리를 채용하고 있으며, 시간방향 상태분할의 경우 학습 음성데이터에서 각 음소별 지속시간 정보를 효과적으로 표현하기 위한 상태분할을 수행하며, 마지막으로 파라미터의 공유를 통해 triphone 형태의 최적인 모델 네트워크를 작성하게 된다. 인식에 사용된 알고리즘은 음소 및 단어인식의 경우에는 One-Pass Viterbi 빔 탐색을 사용하며 트리 구조 형태의 사전과 phone/word-pair 문법을 채용하고 있다. 연속음성인식의 경우에는 단어 bigram과 단어 trigram 언어모델과 목구조 형태의 사전을 채용한 Multi-Pass 빔 탐색을 사용하고 있다. 전체적으로 본 논문에서는 다양한 조건에서 HM-Net 음성인식 시스템의 성능평가를 수행하였으며, 지금까지 소개된 음성인식 시스템과 비교하여 매우 우수한 인식성능을 보임을 실험을 통해 확인할 수 있었다.

  • PDF

SOM 이용한 각성수준의 자동인식 (Automatic Recognition in the Level of Arousal using SOM)

  • 정찬순;함준석;고일주
    • 감성과학
    • /
    • 제14권2호
    • /
    • pp.197-206
    • /
    • 2011
  • 본 논문에서는 신경망 SOM학습을 이용하여 피험자의 각성수준을 높은각성과 낮은각성으로 자동인식하는 것을 제안한다. 각성수준의 자동인식 단계는 세 단계로 구성된다 첫 번째는 ECG 측정 및 분석단계로 슈팅게임을 플레이하는 피험자를 ECG로 측정하고, SOM 학습을 하기 위해 특징을 추출한다. 두 번째는 SOM 학습 단계로 특징이 추출된 입력벡터들을 학습한다. 마지막으로 각성인식 단계는 SOM 학습이 완료된 후에 새로운 입력벡터가 들어왔을 때, 피험자의 각성수준을 인식한다. 실험결과는 각성수준의 SOM 학습결과와 새로운 입력벡터가 들어왔을 때 각성수준의 인식결과, 그리고 각성수준을 수치와 그래프로 보여준다. 마지막으로 SOM의 평가는 기존연구의 감성평가 결과와 SOM의 자동인식 결과를 순차적으로 비교하여 평균 86%로 분석되었다. 본 연구를 통해서 SOM을 이용하여 피험자마다 다른 각성수준을 자동인식 할 수 있었다.

  • PDF

Transformation Based Walking Speed Normalization for Gait Recognition

  • Kovac, Jure;Peer, Peter
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제7권11호
    • /
    • pp.2690-2701
    • /
    • 2013
  • Humans are able to recognize small number of people they know well by the way they walk. This ability represents basic motivation for using human gait as the means for biometric identification. Such biometric can be captured at public places from a distance without subject's collaboration, awareness or even consent. Although current approaches give encouraging results, we are still far from effective use in practical applications. In general, methods set various constraints to circumvent the influence factors like changes of view, walking speed, capture environment, clothing, footwear, object carrying, that have negative impact on recognition results. In this paper we investigate the influence of walking speed variation to different visual based gait recognition approaches and propose normalization based on geometric transformations, which mitigates its influence on recognition results. With the evaluation on MoBo gait dataset we demonstrate the benefits of using such normalization in combination with different types of gait recognition approaches.

특징 선택과 융합 방법을 이용한 음성 감정 인식 (Speech Emotion Recognition using Feature Selection and Fusion Method)

  • 김원구
    • 전기학회논문지
    • /
    • 제66권8호
    • /
    • pp.1265-1271
    • /
    • 2017
  • In this paper, the speech parameter fusion method is studied to improve the performance of the conventional emotion recognition system. For this purpose, the combination of the parameters that show the best performance by combining the cepstrum parameters and the various pitch parameters used in the conventional emotion recognition system are selected. Various pitch parameters were generated using numerical and statistical methods using pitch of speech. Performance evaluation was performed on the emotion recognition system using Gaussian mixture model(GMM) to select the pitch parameters that showed the best performance in combination with cepstrum parameters. As a parameter selection method, sequential feature selection method was used. In the experiment to distinguish the four emotions of normal, joy, sadness and angry, fifteen of the total 56 pitch parameters were selected and showed the best recognition performance when fused with cepstrum and delta cepstrum coefficients. This is a 48.9% reduction in the error of emotion recognition system using only pitch parameters.

Recognition of Identifiers from Shipping Container Image by Using Fuzzy Binarization and ART2-based RBF Network

  • Kim, Kwang-Baek
    • 지능정보연구
    • /
    • 제9권2호
    • /
    • pp.1-18
    • /
    • 2003
  • The automatic recognition of transport containers using image processing is very hard because of the irregular size and position of identifiers, diverse colors of background and identifiers, and the impaired shapes of identifiers caused by container damages and the bent surface of container, etc. We proposed and evaluated the novel recognition algorithm of container identifiers that overcomes effectively the hardness and recognizes identifiers from container images captured in the various environments. The proposed algorithm, first, extracts the area including only all identifiers from container images by using CANNY masking and bi-directional histogram method. The extracted identifier area is binarized by the fuzzy binarization method newly proposed in this paper and by applying contour tracking method to the binarized area, container identifiers which are targets of recognition are extracted. We proposed and applied the ART2-based RBF network for recognition of container identifiers. The results of experiment for performance evaluation on the real container images showed that the proposed algorithm has more improved performance in the extraction and recognition of container identifiers than the previous algorithms.

  • PDF

An Efficient Face Recognition using Feature Filter and Subspace Projection Method

  • Lee, Minkyu;Choi, Jaesung;Lee, Sangyoun
    • Journal of International Society for Simulation Surgery
    • /
    • 제2권2호
    • /
    • pp.64-66
    • /
    • 2015
  • Purpose : In this paper we proposed cascade feature filter and projection method for rapid human face recognition for the large-scale high-dimensional face database. Materials and Methods : The relevant features are selected from the large feature set using Fast Correlation-Based Filter method. After feature selection, project them into discriminant using Principal Component Analysis or Linear Discriminant Analysis. Their cascade method reduces the time-complexity without significant degradation of the performance. Results : In our experiments, the ORL database and the extended Yale face database b were used for evaluation. On the ORL database, the processing time was approximately 30-times faster than typical approach with recognition rate 94.22% and on the extended Yale face database b, the processing time was approximately 300-times faster than typical approach with recognition rate 98.74 %. Conclusion : The recognition rate and time-complexity of the proposed method is suitable for real-time face recognition system on the large-scale high-dimensional face database.

심전도 신호의 신택틱 패턴인식 (Syntatic Pattern recognition of the ECG)

  • 남승우;이병채;신건수;이재준;이명호
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1991년도 추계학술대회
    • /
    • pp.129-132
    • /
    • 1991
  • This paper describes the ECG pattern recognition using the syntatic pattern recognition algorithm. The algorithm uses the BNF rule wi th the semantic evaluation which has the structural Information of the ECG. This algorithm is constructed with (1) removing the baseline drift by the Cubic spline function and exract the significant point by the line-approximation algorithm, (2) syntatic peak recognition algorithm with the extracted significant point, (3) produce the token which is used pattern recognition, (4) pattern recognition of the ECG by the syntatic pattern recognition algorithm, (5) extract the parameter with the pattern recognized ECG signal.

  • PDF

Wiener Filtering을 이용한 잡음환경에서의 음성인식 (Speech Recognition in Noisy Environments using Wiener Filtering)

  • 김진영;엄기완;최홍섭
    • 음성과학
    • /
    • 제1권
    • /
    • pp.277-283
    • /
    • 1997
  • In this paper, we present a robust recognition algorithm based on the Wiener filtering method as a research tool to develop the Korean Speech recognition system. We especially used Wiener filtering method in cepstrum-domain, because the method in frequency-domain is computationally expensive and complex. Evaluation of the effectiveness of this method has been conducted in speaker-independent isolated Korean digit recognition tasks using discrete HMM speech recognition systems. In these tasks, we used 12th order weighted cepstral as a feature vector and added computer simulated white gaussian noise of different levels to clean speech signals for recognition experiments under noisy conditions. Experimental results show that the presented algorithm can provide an improvement in recognition of as much as from $5\%\;to\;\20\%$ in comparison to spectral subtraction method.

  • PDF

주민참여에 의한 농촌경관자원조사 방법 연구 - 경관맵 사례 분석을 중심으로 - (A study on a research method measuring rural landscape resources by inhabitants participation - Focused on a case study using Landscape Evaluation Map)

  • 이정원;윤진옥;임승빈
    • 농촌계획
    • /
    • 제16권4호
    • /
    • pp.13-22
    • /
    • 2010
  • Rural landscape is an outcome of residents' life activity based on natural environment. Unlike city, rural residents make their own landscape over a period of time interacting with nature through cultivating and building houses and huts based on the background. Therefore, residents' role in rural area is of greater importance than city's and their recognition of landscape is a key factor to evaluate and manage rural landscape. Landscape Evaluation Map which utilizing Feeling Map method is a evaluation tool to [md out residents' recognition of landscape. In this tool, responses evaluate landscape around their living space and mark color dots which mean landscape grade on a map. This research is to examine effectiveness and applicability of the tool, Landscape Evaluation Map, which is recommended to estimate residents' evaluation of landscape. Through analyzing 7 cases of field application, the effectiveness of Landscape Evaluation Map has been verified and also demerits have been drawn. After modifying detailed techniques and developing resident education, Landscape evaluation map could be applied to [md out landscape resources rather than to evaluate whole rural landscape.

인지-감정요소에 의한 공간이미지 평가성 분석 (Analysis on Space Image Evaluation through Recognitive-Emotional Factor)

  • 송영민;이동기
    • 한국실내디자인학회논문집
    • /
    • 제20권6호
    • /
    • pp.71-78
    • /
    • 2011
  • Although the recognition and emotion about space is subjective and individual, if standard is proposed through common factor, objective, quantified space image evaluation will be available. In addition, space image evaluation standard caused by recognitive-emotional factor can meet requests of space users and increase psychological satisfactions. The purpose of this study is to grasp the space image caused by recognitive-emotional factor in space with PAD model and analyze the evaluation of space image giving visual, recognitive and emotional effects. The analysis result revealed that 'joyfulness' and access-avoidance had a very similar distribution. The result means that space is evaluated with the degree of 'joyfulness' for space and it is led by approach-avoidance behavior. The recognition factor that forms and evaluates space image and decides approach-avoidance is expressed as adjective images such as 'fresh, joyful, light and static and its emotional factors are adjective images such as 'calm, allowable, joyful and quiet'.