• 제목/요약/키워드: automatic recognition

검색결과 1,066건 처리시간 0.04초

Facial Expression Recognition through Self-supervised Learning for Predicting Face Image Sequence

  • Yoon, Yeo-Chan;Kim, Soo Kyun
    • 한국컴퓨터정보학회논문지
    • /
    • 제27권9호
    • /
    • pp.41-47
    • /
    • 2022
  • 본 논문에서는 자동표정인식을 위하여 얼굴 이미지 배열의 가운데 이미지를 예측하는 새롭고 간단한 자기주도학습 방법을 제안한다. 자동표정인식은 딥러닝 모델을 통해 높은 성능을 달성할 수 있으나 일반적으로 큰 비용과 시간이 투자된 대용량의 데이터 세트가 필요하고, 데이터 세트의 크기와 알고리즘의 성능이 비례한다. 제안하는 방법은 추가적인 데이터 세트 구축 없이 기존의 데이터 세트를 활용하여 자기주도학습을 통해 얼굴의 잠재적인 심층표현방법을 학습하고 학습된 파라미터를 전이시켜 자동표정인식의 성능을 향상한다. 제안한 방법은 CK+와 AFEW 8.0 두가지 데이터 세트에 대하여 높은 성능 향상을 보여주었고, 간단한 방법으로 큰 효과를 얻을 수 있음을 보여주었다.

버섯 전후면과 꼭지부 상태의 자동 인식 (Automatic Recognition of the Front/Back Sides and Stalk States for Mushrooms(Lentinus Edodes L.))

  • 황헌;이충호
    • Journal of Biosystems Engineering
    • /
    • 제19권2호
    • /
    • pp.124-137
    • /
    • 1994
  • Visual features of a mushroom(Lentinus Edodes, L.) are critical in grading and sorting as most agricultural products are. Because of its complex and various visual features, grading and sorting of mushrooms have been done manually by the human expert. To realize the automatic handling and grading of mushrooms in real time, the computer vision system should be utilized and the efficient and robust processing of the camera captured visual information be provided. Since visual features of a mushroom are distributed over the front and back sides, recognizing sides and states of the stalk including the stalk orientation from the captured image is a prime process in the automatic task processing. In this paper, the efficient and robust recognition process identifying the front and back side and the state of the stalk was developed and its performance was compared with other recognition trials. First, recognition was tried based on the rule set up with some experimental heuristics using the quantitative features such as geometry and texture extracted from the segmented mushroom image. And the neural net based learning recognition was done without extracting quantitative features. For network inputs the segmented binary image obtained from the combined type automatic thresholding was tested first. And then the gray valued raw camera image was directly utilized. The state of the stalk seriously affects the measured size of the mushroom cap. When its effect is serious, the stalk should be excluded in mushroom cap sizing. In this paper, the stalk removal process followed by the boundary regeneration of the cap image was also presented. The neural net based gray valued raw image processing showed the successful results for our recognition task. The developed technology through this research may open the new way of the quality inspection and sorting especially for the agricultural products whose visual features are fuzzy and not uniquely defined.

  • PDF

한국어 자동 발음열 생성을 위한 예외발음사전 구축 (Building an Exceptional Pronunciation Dictionary For Korean Automatic Pronunciation Generator)

  • 김선희
    • 음성과학
    • /
    • 제10권4호
    • /
    • pp.167-177
    • /
    • 2003
  • This paper presents a method of building an exceptional pronunciation dictionary for Korean automatic pronunciation generator. An automatic pronunciation generator is an essential element of speech recognition system and a TTS (Text-To-Speech) system. It is composed of a part of regular rules and an exceptional pronunciation dictionary. The exceptional pronunciation dictionary is created by extracting the words which have exceptional pronunciations from text corpus based on the characteristics of the words of exceptional pronunciation through phonological research and text analysis. Thus, the method contributes to improve performance of Korean automatic pronunciation generator as well as the performance of speech recognition system and TTS system.

  • PDF

인간의 언어와 얼굴 표정에 통하여 자동적으로 감정 인식 시스템 새로운 접근법 (Automatic Human Emotion Recognition from Speech and Face Display - A New Approach)

  • 딩�E령;이영구;이승룡
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2011년도 한국컴퓨터종합학술대회논문집 Vol.38 No.1(B)
    • /
    • pp.231-234
    • /
    • 2011
  • Audiovisual-based human emotion recognition can be considered a good approach for multimodal humancomputer interaction. However, the optimal multimodal information fusion remains challenges. In order to overcome the limitations and bring robustness to the interface, we propose a framework of automatic human emotion recognition system from speech and face display. In this paper, we develop a new approach for fusing information in model-level based on the relationship between speech and face expression to detect automatic temporal segments and perform multimodal information fusion.

Detection of Stator Winding Inter-Turn Short Circuit Faults in Permanent Magnet Synchronous Motors and Automatic Classification of Fault Severity via a Pattern Recognition System

  • CIRA, Ferhat;ARKAN, Muslum;GUMUS, Bilal
    • Journal of Electrical Engineering and Technology
    • /
    • 제11권2호
    • /
    • pp.416-424
    • /
    • 2016
  • In this study, automatic detection of stator winding inter-turn short circuit fault (SWISCFs) in surface-mounted permanent magnet synchronous motors (SPMSMs) and automatic classification of fault severity via a pattern recognition system (PRS) are presented. In the case of a stator short circuit fault, performance losses become an important issue for SPMSMs. To detect stator winding short circuit faults automatically and to estimate the severity of the fault, an artificial neural network (ANN)-based PRS was used. It was found that the amplitude of the third harmonic of the current was the most distinctive characteristic for detecting the short circuit fault ratio of the SPMSM. To validate the proposed method, both simulation results and experimental results are presented.

A Single Channel Speech Enhancement for Automatic Speech Recognition

  • 이진규;서현손;강홍구
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2011년도 하계학술대회
    • /
    • pp.85-88
    • /
    • 2011
  • This paper describes a single channel speech enhancement as the pre-processor of automatic speech recognition system. The improvements are based on using optimally modified log-spectra (OM-LSA) gain function with a non-causal a priori signal-to-noise ratio (SNR) estimation. Experimental results show that the proposed method gives better perceptual evaluation of speech quality score (PESQ) and lower log-spectral distance, and also better word accuracy. In the enhancement system, parameters was turned for automatic speech recognition.

  • PDF

MATHEMATICAL IMAGE PROCESSING FOR AUTOMATIC NUMBER PLATE RECOGNITION SYSTEM

  • Kim, Sun-Hee;Oh, Seung-Mi;Kang, Myung-Joo
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • 제14권1호
    • /
    • pp.57-66
    • /
    • 2010
  • In this paper, we develop the Automatic Number Plate Recognition (ANPR) System. ANPR is generally composed of the following four steps: i) The acquisition of the image; ii) The extraction of the region of the number plate; iii) The partition of the number and iv) The recognition. The second and third steps incorporate image processing technique. We propose to resolve this by using Partial Differential Equation(PDE) based segmentation method. This method is computationally efficient and robust. Results indicate that our methods are capable to recognize the plate number on difficult situations.

Ensemble convolutional neural networks for automatic fusion recognition of multi-platform radar emitters

  • Zhou, Zhiwen;Huang, Gaoming;Wang, Xuebao
    • ETRI Journal
    • /
    • 제41권6호
    • /
    • pp.750-759
    • /
    • 2019
  • Presently, the extraction of hand-crafted features is still the dominant method in radar emitter recognition. To solve the complicated problems of selection and updation of empirical features, we present a novel automatic feature extraction structure based on deep learning. In particular, a convolutional neural network (CNN) is adopted to extract high-level abstract representations from the time-frequency images of emitter signals. Thus, the redundant process of designing discriminative features can be avoided. Furthermore, to address the performance degradation of a single platform, we propose the construction of an ensemble learning-based architecture for multi-platform fusion recognition. Experimental results indicate that the proposed algorithms are feasible and effective, and they outperform other typical feature extraction and fusion recognition methods in terms of accuracy. Moreover, the proposed structure could be extended to other prevalent ensemble learning alternatives.

젠지미어 압연기 제어시스템에서 형상인식에 관한 성능분석 (Performance analysis of shape recognition in Senzimir mill control systems)

  • 이문희;신종민;한성익;김종식
    • 동력기계공학회지
    • /
    • 제15권5호
    • /
    • pp.83-90
    • /
    • 2011
  • In general, 20-high Sendzimir mills(ZRM) use small diameter work rolls to provide massive rolling force. Because of small diameter of work rolls, steel strip has a complex shape mixed with quarter, edge and center waves. Especially when the shape of the strip is controlled automatically, the actuator saturation occurs. These problems affect the productivity and quality of products. In this paper, the problems in automatic shape control of ZRM were analyzed. In order to evaluate the problems for the automatic shape control in ZRM, recognition performance was analyzed by comparing the measured shape and the recognized shape. The actuator positions by the shape recognition and the manual operation were compared. From the analysis results, the necessity of the improvement of recognition performance in ZRM is suggested.

적외선 영상을 이용한 Gradient Vector Field 기반의 표적 및 화염 자동인식 연구 (A Study of Automatic Recognition on Target and Flame Based Gradient Vector Field Using Infrared Image)

  • 김춘호;이주영
    • 한국항공우주학회지
    • /
    • 제49권1호
    • /
    • pp.63-73
    • /
    • 2021
  • 본 논문은 공중 혹은 해상배경에 표적과 화염이 동시에 존재할 때, 무인항공기에 장착된 EOTS(Electro-Optical Targeting System; 전자광학 추적장비)가 표적을 추적하기 위해 화염의 영향에 강건하도록 표적을 자동 인식하는 기법을 제안한다. 제안한 기법은 표적과 화염의 적외선 영상을 Gradient Vector Field로 변환하고, 각 Gradient magnitude를 Polynomial Curve Fitting 도구에 적용하여 다항식 계수를 추출 및 얕은 신경망 모델에 학습함으로써, 표적과 화염을 자동으로 인식한다. 확보한 표적 및 화염의 다양한 적외선 영상 DB를 학습데이터, 검증데이터, 시험데이터로 분류하여 제안한 기법의 표적 및 화염 자동 인식 성능을 확인하였다. 본 알고리듬을 활용하여 무인항공기의 자동비행 중 충돌회피, 산불탐지, 공중 및 해상의 목표물을 자동탐지 및 인식하는 분야에 적용될 수 있다.