• Title/Summary/Keyword: automatic recognition

Search Result 1,066, Processing Time 0.026 seconds

Design of Robust Face Recognition System Realized with the Aid of Automatic Pose Estimation-based Classification and Preprocessing Networks Structure

  • Kim, Eun-Hu;Kim, Bong-Youn;Oh, Sung-Kwun;Kim, Jin-Yul
    • Journal of Electrical Engineering and Technology
    • /
    • v.12 no.6
    • /
    • pp.2388-2398
    • /
    • 2017
  • In this study, we propose a robust face recognition system to pose variations based on automatic pose estimation. Radial basis function neural network is applied as one of the functional components of the overall face recognition system. The proposed system consists of preprocessing and recognition modules to provide a solution to pose variation and high-dimensional pattern recognition problems. In the preprocessing part, principal component analysis (PCA) and 2-dimensional 2-directional PCA ($(2D)^2$ PCA) are applied. These functional modules are useful in reducing dimensionality of the feature space. The proposed RBFNNs architecture consists of three functional modules such as condition, conclusion and inference phase realized in terms of fuzzy "if-then" rules. In the condition phase of fuzzy rules, the input space is partitioned with the use of fuzzy clustering realized by the Fuzzy C-Means (FCM) algorithm. In conclusion phase of rules, the connections (weights) are realized through four types of polynomials such as constant, linear, quadratic and modified quadratic. The coefficients of the RBFNNs model are obtained by fuzzy inference method constituting the inference phase of fuzzy rules. The essential design parameters (such as the number of nodes, and fuzzification coefficient) of the networks are optimized with the aid of Particle Swarm Optimization (PSO). Experimental results completed on standard face database -Honda/UCSD, Cambridge Head pose, and IC&CI databases demonstrate the effectiveness and efficiency of face recognition system compared with other studies.

A Study on the Improvement of Automatic Text Recognition of Road Signs Using Location-based Similarity Verification (위치기반 유사도 검증을 이용한 도로표지 안내지명 자동인식 개선방안 연구)

  • Chong, Kyusoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.18 no.6
    • /
    • pp.241-250
    • /
    • 2019
  • Road signs are guide facilities for road users, and the Ministry of Land, Infrastructure and Transport has established and operated a system to enhance the convenience of managing these road signs. The role of road signs will decrease in the future autonomous driving, but they will continue to be needed. For the accurate mechanical recognition of texts on road signs, automatic road sign recognition equipment has been developed and it has applied image-based text recognition technology. Yet there are many cases of misrecognition due to irregular specifications and external environmental factors such as manual manufacturing, illumination, light reflection, and rainfall. The purpose of this study is to derive location-based destination names for finding misrecognition errors that cannot be overcome by image analysis, and to improve the automatic recognition of road signs destination names by using Levenshtein similarity verification method based on phoneme separation.

A Comparative Study of Speech Parameters for Speech Recognition Neural Network (음성 인식 신경망을 위한 음성 파라키터들의 성능 비교)

  • Kim, Ki-Seok;Im, Eun-Jin;Hwang, Hee-Yung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.11 no.3
    • /
    • pp.61-66
    • /
    • 1992
  • There have been many researches that uses neural network models for automatic speech recognition, but the main trend was finding the neural network models and learning rules appropriate to automatic speech recognition. However, the choice of the input speech parameter for the neural network as well as neural network model itself is a very important factor for the improvement of performance of the automatic speech recognition system using neural network. In this paper we select 6 speech parameters from surveys of the speech recognition papers which uses neural networks, and analyze the performance for the same data and the same neural network model. We use 8 sets of 9 Korean plosives and 18 sets of 8 Korean vowels. We use recurrent neural network and compare the performance of the 6 speech parameters while the number of nodes is constant. The delta cepstrum of linear predictive coefficients showed best result and the recognition rates are 95.1% for the vowels and 100.0% for plosives.

  • PDF

A Study of Automatic Evaluation Platform for Speech Recognition Engine in the Vehicle Environment (자동차 환경내의 음성인식 자동 평가 플랫폼 연구)

  • Lee, Seong-Jae;Kang, Sun-Mee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.7C
    • /
    • pp.538-543
    • /
    • 2012
  • The performance of the speech recognition engine is one of the most critical elements of the in-vehicle speech recognition interface. The objective of this paper is to develop an automated platform for running performance tests on the in-vehicle speech recognition engine. The developed platform comprise of main program, agent program, database management module, and statistical analysis module. A simulation environment for performance tests which mimics the real driving situations was constructed, and it was tested by applying pre-recorded driving noises and a speaker's voice as inputs. As a result, the validity of the results from the speech recognition tests was proved. The users will be able to perform the performance tests for the in-vehicle speech recognition engine effectively through the proposed platform.

Driving three kinds of Course Test with RC car by Color Recognition (색깔 인식에 의한 RC car의 3가지 코스 시험 주행)

  • Lee, Jong-Min;Sohn, Young-Sun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.1
    • /
    • pp.33-39
    • /
    • 2014
  • Automatic driving needs many functions such as the obstacle recognition, the lane recognition, and the lane change, etc. In this paper, we realized a system which automatically drove the three-kinds of vehicle driving course, to introduce and apply the concept of 'color recognition' that expands the scope of 'lane recognition' for vehicle driving. We made the reduced each course compared with RC(Radio Control) car size, and controlled the steering considering the position and the slope of the detection line and the speed. Because the RC car does not have the brake function, we consider the speed and the position of the detection line to stop the RC car.

Modulation Recognition of MIMO Systems Based on Dimensional Interactive Lightweight Network

  • Aer, Sileng;Zhang, Xiaolin;Wang, Zhenduo;Wang, Kailin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.10
    • /
    • pp.3458-3478
    • /
    • 2022
  • Automatic modulation recognition is the core algorithm in the field of modulation classification in communication systems. Our investigations show that deep learning (DL) based modulation recognition techniques have achieved effective progress for multiple-input multiple-output (MIMO) systems. However, network complexity is always an additional burden for high-accuracy classifications, which makes it impractical. Therefore, in this paper, we propose a low-complexity dimensional interactive lightweight network (DilNet) for MIMO systems. Specifically, the signals received by different antennas are cooperatively input into the network, and the network calculation amount is reduced through the depth-wise separable convolution. A two-dimensional interactive attention (TDIA) module is designed to extract interactive information of different dimensions, and improve the effectiveness of the cooperation features. In addition, the TDIA module ensures low complexity through compressing the convolution dimension, and the computational burden after inserting TDIA is also acceptable. Finally, the network is trained with a penalized statistical entropy loss function. Simulation results show that compared to existing modulation recognition methods, the proposed DilNet dramatically reduces the model complexity. The dimensional interactive lightweight network trained by penalized statistical entropy also performs better for recognition accuracy in MIMO systems.

An Error Examination of 3D Face Automatic Recognition (3차원 안면자동인식기의 형상복원 오차검사)

  • Suk, Jae-Hwa;Cho, Kyung-Rae;Cho, Yong-Beum;Yoo, Jung-Hee;Kwak, Chang-Kyu;Lee, Soo-Kyung;Kho, Byung-Hee;Kim, Jong-Won;Kim, Kyu-Kon;Lee, Eui-Ju
    • Journal of Sasang Constitutional Medicine
    • /
    • v.18 no.2
    • /
    • pp.41-49
    • /
    • 2006
  • 1. Objectives The Face is an important standard for the classification of Sasang Contitutions. We are developing 3D Face Automatic Recognition Apparatus to analyse the facial characteristics. So We should examine a shape demobilization error of 3D Face Automatic Recognition Apparatus. 2. Methods We compared facial shape data be demobilized by 3D Face Automatic Recognition Apparatus with facial shape data that be demobilized by 3D laser scanner. The subject was two korean men. And We analysed the average error and the maximum error of two data. In this process, We used one datum point(the peak of nose) and two datum line(vertical section and horizontal section). 3. Results and Conclusions In each this comparison, the average error of vertical section was 1.962574mm and 2.703814mm. and the maximum error of vertical section was 16.968249mm and 18.61464mm. the average error of horizontal section was 4.173203mm and 21.487479mm. and the maximum error of horizontal section was 3.571210mm and 17.13255mm. Also We complemented this apparatus a little and We reexamined a shape demobilization error of 3D Face Automatic Recognition Apparatus again. Accuracy of a shape demobilization was improved a little. From now on We complement accuracy of a shape demobilization in 3D Face Recognition Apparatus.

  • PDF

Automatic Correction of Word-spacing Errors using by Syllable Bigram (음절 bigram를 이용한 띄어쓰기 오류의 자동 교정)

  • Kang, Seung-Shik
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.83-90
    • /
    • 2001
  • We proposed a probabilistic approach of using syllable bigrams to the word-spacing problem. Syllable bigrams are extracted and the frequencies are calculated for the large corpus of 12 million words. Based on the syllable bigrams, we performed three experiments: (1) automatic word-spacing, (2) detection and correction of word-spacing errors for spelling checker, and (3) automatic insertion of a space at the end of line in the character recognition system. Experimental results show that the accuracy ratios are 97.7 percent, 82.1 percent, and 90.5%, respectively.

  • PDF

Development of a Novel System for Measuring Sizing Degree Based on Contact Angle (II) - Reliability and Reproducibility of the New Automatic Measuring System for Contact Angle - (접촉각 측정 원리를 이용한 새로운 사이즈도 측정기 (제2보) -자동 접촉각 측정 시스템의 신뢰성 및 재현성 -)

  • 이찬용;김철환;최경민;박종열;권오철
    • Journal of Korea Technical Association of The Pulp and Paper Industry
    • /
    • v.35 no.3
    • /
    • pp.53-58
    • /
    • 2003
  • The newly developed system for measuring a contact angle on a sheet was examined to investigate reliability and reproducibility of the measured results. It was clearly confirmed that the automatic contact angle measuring system was much faster and more reliable way to determine the water resistance of a sheet, comparing with Cobb and Stockigt sizing tests. Cobb test showed less significant results with stringently sized sheets, and Stockigt test exhibited the big deviations by discrepancy of the recognition point of coloring according to different testers in spite of explicit test results. On the other hand, the contact angles measured by the automatic system were reproduced with less deviations, irrespectively of different testers. It was interesting to note that the contact angle might be able to used to predict Cobb and Stockigt sizing degree, based upon the high correlation coefficients of 0.95 and 0.97. Hereafter the automatic system will be upgraded to predict Cobb and stockigt sizing degree through the measurement of contact angle.

A New Temporal Filtering Method for Improved Automatic Lipreading (향상된 자동 독순을 위한 새로운 시간영역 필터링 기법)

  • Lee, Jong-Seok;Park, Cheol-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.15B no.2
    • /
    • pp.123-130
    • /
    • 2008
  • Automatic lipreading is to recognize speech by observing the movement of a speaker's lips. It has received attention recently as a method of complementing performance degradation of acoustic speech recognition in acoustically noisy environments. One of the important issues in automatic lipreading is to define and extract salient features from the recorded images. In this paper, we propose a feature extraction method by using a new filtering technique for obtaining improved recognition performance. The proposed method eliminates frequency components which are too slow or too fast compared to the relevant speech information by applying a band-pass filter to the temporal trajectory of each pixel in the images containing the lip region and, then, features are extracted by principal component analysis. We show that the proposed method produces improved performance in both clean and visually noisy conditions via speaker-independent recognition experiments.