• Title/Summary/Keyword: recognition distance

Search Result 1,004, Processing Time 0.031 seconds

The Analysis of Face Recognition Rate according to Distance and Interpolation using PCA in Surveillance System (감시카메라 시스템에서 PCA에 의한 보간법과 거리별 얼굴인식률 분석)

  • Moon, Hae-Min;Kwak, Keun-Chang;Pan, Sung-Bum
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.21 no.6
    • /
    • pp.153-160
    • /
    • 2011
  • Recently, the use of security surveillance system including CCTV is increasing due to the increase of terrors and crimes. At the same time, interest of face recognition at a distance using surveillance cameras has been increasing. Accordingly, we analyzed the performance of face recognition according to distance using PCA-based face recognition and interpolation. In this paper, we used Nearest, Bilinear, Bicubic, Lanczos3 interpolations to interpolate face image. As a result, we confirmed that existing interpolation have an few effect on performance of PCA-based face recognition and performance of PCA-based face recognition is improved by including face image according to distance in traning data.

Signal Processing for Speech Recognition in Noisy Environment (잡음 환경에서 음성 인식을 위한 신호처리)

  • Kim, Weon-Goo;Lim, Yong-Hoon;Cha, Il-Whan;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.11 no.2
    • /
    • pp.73-84
    • /
    • 1992
  • This paper studies noise subtraction methods and distance measures for speech recognition in a noisy environment, and investigates noise robustness of the distance measures applied to the problem of isolated word recognition in white Gaussian and colored noise (vehicle noise) environments. Noise subtraction methods which can be used as a pre-processor for the speech recognition system, such as the spectral subtraction method, autocorrelation subtraction method, adaptive noise cancellation and acoustic beamforming are studied, and distance measures such and Log Likelihood Ratio ($d_{LLR}$), cepstral distance measure ($d_{CEP}$), weighted cepstral distance measure ($d_{WCEP}$), spectral slope distance measure ($d_{RPS}$) and cepstral projection distance measure ($d_{CP},\;d_{BCP},\;d_{WCP},\;d_{BWCP}$) are also investigated. Testing of the distance measures for speaker-dependent isolated word recognition in a noisy environment indicate that $d_{RPS}\;and\;d_{WCEP}$ which weigh higher order cepstral coefficients more heavily give considerable performance improvement over $d_{CEP}and\;d_{LLR}$. In addition, when no pre-emphasis is performed, the recognizer can maintain higher performance under high noise conditions.

  • PDF

RowAMD Distance: A Novel 2DPCA-Based Distance Computation with Texture-Based Technique for Face Recognition

  • Al-Arashi, Waled Hussein;Shing, Chai Wuh;Suandi, Shahrel Azmin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.11
    • /
    • pp.5474-5490
    • /
    • 2017
  • Although two-dimensional principal component analysis (2DPCA) has been shown to be successful in face recognition system, it is still very sensitive to illumination variations. To reduce the effect of these variations, texture-based techniques are used due to their robustness to these variations. In this paper, we explore several texture-based techniques and determine the most appropriate one to be used with 2DPCA-based techniques for face recognition. We also propose a new distance metric computation in 2DPCA called Row Assembled Matrix Distance (RowAMD). Experiments on Yale Face Database, Extended Yale Face Database B, AR Database and LFW Database reveal that the proposed RowAMD distance computation method outperforms other conventional distance metrics when Local Line Binary Pattern (LLBP) and Multi-scale Block Local Binary Pattern (MB-LBP) are used for face authentication and face identification, respectively. In addition to this, the results also demonstrate the robustness of the proposed RowAMD with several texture-based techniques.

A Study on the Impact of Speech Data Quality on Speech Recognition Models

  • Yeong-Jin Kim;Hyun-Jong Cha;Ah Reum Kang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.1
    • /
    • pp.41-49
    • /
    • 2024
  • Speech recognition technology is continuously advancing and widely used in various fields. In this study, we aimed to investigate the impact of speech data quality on speech recognition models by dividing the dataset into the entire dataset and the top 70% based on Signal-to-Noise Ratio (SNR). Utilizing Seamless M4T and Google Cloud Speech-to-Text, we examined the text transformation results for each model and evaluated them using the Levenshtein Distance. Experimental results revealed that Seamless M4T scored 13.6 in models using data with high SNR, which is lower than the score of 16.6 for the entire dataset. However, Google Cloud Speech-to-Text scored 8.3 on the entire dataset, indicating lower performance than data with high SNR. This suggests that using data with high SNR during the training of a new speech recognition model can have an impact, and Levenshtein Distance can serve as a metric for evaluating speech recognition models.

Performance Comparison of Template-based Face Recognition under Robotic Environments (로봇 환경의 템플릿 기반 얼굴인식 알고리즘 성능 비교)

  • Ban, Kyu-Dae;Kwak, Keun-Chang;Chi, Su-Young;Chung, Yun-Koo
    • The Journal of Korea Robotics Society
    • /
    • v.1 no.2
    • /
    • pp.151-157
    • /
    • 2006
  • This paper is concerned with the template-based face recognition from robot camera images with illumination and distance variations. The approaches used in this paper consist of Eigenface, Fisherface, and Icaface which are the most representative recognition techniques frequently used in conjunction with face recognition. These approaches are based on a popular unsupervised and supervised statistical technique that supports finding useful image representations, respectively. Thus we focus on the performance comparison from robot camera images with unwanted variations. The comprehensive experiments are completed for a databases with illumination and distance variations.

  • PDF

Speaker-Independent Korean Digit Recognition Using HCNN with Weighted Distance Measure (가중 거리 개념이 도입된 HCNN을 이용한 화자 독립 숫자음 인식에 관한 연구)

  • 김도석;이수영
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.18 no.10
    • /
    • pp.1422-1432
    • /
    • 1993
  • Nonlinear mapping function of the HCNN( Hidden Control Neural Network ) can change over time to model the temporal variability of a speech signal by combining the nonlinear prediction of conventional neural networks with the segmentation capability of HMM. We have two things in this paper. first, we showed that the performance of the HCNN is better than that of HMM. Second, the HCNN with its prediction error measure given by weighted distance is proposed to use suitable distance measure for the HCNN, and then we showed that the superiority of the proposed system for speaker-independent speech recognition tasks. Weighted distance considers the differences between the variances of each component of the feature vector extraced from the speech data. Speaker-independent Korean digit recognition experiment showed that the recognition rate of 95%was obtained for the HCNN with Euclidean distance. This result is 1.28% higher than HMM, and shows that the HCNN which models the dynamical system is superior to HMM which is based on the statistical restrictions. And we obtained 97.35% for the HCNN with weighted distance, which is 2.35% better than the HCNN with Euclidean distance. The reason why the HCNN with weighted distance shows better performance is as follows : it reduces the variations of the recognition error rate over different speakers by increasing the recognition rate for the speakers who have many misclassified utterances. So we can conclude that the HCNN with weighted distance is more suit-able for speaker-independent speech recognition tasks.

  • PDF

Control System for Smart Medical Illumination Based on Voice Recognition (음성인식기반 스마트 의료조명 제어시스템)

  • Kim, Min-Kyu;Lee, Soo-In;Cho, Hyun-Kil
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.8 no.3
    • /
    • pp.179-184
    • /
    • 2013
  • A voice recognition technology as a technology fundament plays an important role in medical devices with smart functions. This paper describes the implementation of a control system that can be utilized as a part of illumination equipment for medical applications (IEMA) based on a voice recognition. The control system can essentially be divided into five parts, the microphone, training part, recognition part, memory part, and control part. The system was implemented using the RSC-4x evaluation board which is included the micro-controller for voice recognition. To investigate the usefulness of the implemented control system, the experiments of the recognition rate was carried out according to the input distance for voice recognition. As a result, the recognition rate of the control system was more than 95% within a distance between 0.5 and 2m. The result verified that the implemented control system performs well as the smart control system based for an IEMA.

Distance measure between intuitionistic fuzzy sets and its application to pattern recognition

  • Park, Jin-Han;Lim, Ki-Moon;Kwun, Young-Chel
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.4
    • /
    • pp.556-561
    • /
    • 2009
  • In this paper, we propose new method to calculate the distance between intuitionistic fuzzy sets(IFSs) based on the three dimensional representation of IFSs and analyze the relations of similarity measure and distance measure of IFSs. Finally, we apply the proposed measures to pattern recognitions.

Development Small Size RGB Sensor for Providing Long Detecting Range (원거리 검출범위를 제공하는 소형 RGB 센서 개발)

  • Seo, Jae Yong;Lee, Si Hyun
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.12
    • /
    • pp.174-182
    • /
    • 2015
  • In this paper, we developed the small size RGB sensor that recognizes a long distance using a low-cost color sensor. Light receiving portion of the sensor was used as a camera lens for far distance recognition, and illuminating unit was increased the strength of the light by using a high-power white LED and a lens mounted on the reflector. RGB color recognition algorithm consists of the learning process and the realtime recognition process. We obtain a normalized RGB color reference data in the learning process using the specimens painted with target colors, and classifies the three colors using the Mahalanobis distance in recognition process. We apply the developed the RGB color recognition sensor to a prototype of the part classification system and evaluate the performance of its.

The Recognition and Distance Estimation of a Golf Ball using a WebCam (웹캠을 이용한 골프공 인식 및 위치추정 시스템)

  • Zhu, Jiaqi;Chong, Jiang;Kim, Kang-Chul
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.12
    • /
    • pp.1833-1840
    • /
    • 2013
  • A golf ball collecting robot in a golf ball driving range has been recently required because it is safer and more economic than a human being. In this paper, the golf ball recognition and distance estimation system based on a neural network and OpenCV is developed for the robot. The simulation results show that the recognition ratio is over 87% for the distance of less than 120cm and accurate rate for distance estimation is over 85% for golf balls in 30-180cm from a webcam.