• 제목/요약/키워드: Recognition and Detection

검색결과 2,252건 처리시간 0.037초

멀티밴드 스펙트럼 차감법과 엔트로피 하모닉을 이용한 잡음환경에 강인한 분산음성인식 (Robust Distributed Speech Recognition under noise environment using MESS and EH-VAD)

  • 최갑근;김순협
    • 전자공학회논문지CI
    • /
    • 제48권1호
    • /
    • pp.101-107
    • /
    • 2011
  • 음성인식의 실용화에 가장 저해되는 요소는 배경잡음과 채널에 의한 왜곡이다. 일반적으로 잡음은 음성인식 시스템의 성능을 저하시키고 이로 인해 사용 장소의 제약을 많이 받고 있다. DSR(Distributed Speech Recognition) 기반의 음성인식 역시 이 같은 문제로 성능 향상에 어려움을 겪고 있다. 이 논문은 잡음환경에서 DSR기반의 음성인식률 향상을 위해 정확한 음성구간을 검출하고, 잡음을 제거하여 잡음에 강인한 특징추출을 하도록 설계하였다. 제안된 방법은 엔트로피와 음성의 하모닉을 이용해 음성구간을 검출하며 멀티밴드 스펙트럼 차감법을 이용하여 잡음을 제거한다. 음성의 스펙트럼 에너지에 대한 엔트로피를 사용하여 음성검출을 하게 되면 비교적 높은 SNR 환경 (SNR 15dB) 에서는 성능이 우수하나 잡음환경의 변화에 따라 음성과 비음성의 문턱 값이 변화하여 낮은 SNR환경(SNR 0dB)에시는 정확한 음성 검출이 어렵다. 이 논문은 낮은 SNR 환경(0dB)에서도 정확한 음성을 검출할 수 있도록 음성의 스펙트럴 엔트로피와 하모닉 성분을 이용하였으며 정확한 음성 구간 검출에 따라 잡음을 제거하여 잡음에 강인한 특정을 추출하도록 하였다. 실험결과 잡음환경에 따른 인식조건에서 개선된 인식성능을 보였다.

AdaBoost와 ASM을 활용한 얼굴 검출 (Face Detection using AdaBoost and ASM)

  • 이용환;김흥준
    • 반도체디스플레이기술학회지
    • /
    • 제17권4호
    • /
    • pp.105-108
    • /
    • 2018
  • Face Detection is an essential first step of the face recognition, and this is significant effects on face feature extraction and the effects of face recognition. Face detection has extensive research value and significance. In this paper, we present and analysis the principle, merits and demerits of the classic AdaBoost face detection and ASM algorithm based on point distribution model, which ASM solves the problems of face detection based on AdaBoost. First, the implemented scheme uses AdaBoost algorithm to detect original face from input images or video stream. Then, it uses ASM algorithm converges, which fit face region detected by AdaBoost to detect faces more accurately. Finally, it cuts out the specified size of the facial region on the basis of the positioning coordinates of eyes. The experimental result shows that the method can detect face rapidly and precisely, with a strong robustness.

Study On Masked Face Detection And Recognition using transfer learning

  • Kwak, NaeJoung;Kim, DongJu
    • International Journal of Advanced Culture Technology
    • /
    • 제10권1호
    • /
    • pp.294-301
    • /
    • 2022
  • COVID-19 is a crisis with numerous casualties. The World Health Organization (WHO) has declared the use of masks as an essential safety measure during the COVID-19 pandemic. Therefore, whether or not to wear a mask is an important issue when entering and exiting public places and institutions. However, this makes face recognition a very difficult task because certain parts of the face are hidden. As a result, face identification and identity verification in the access system became difficult. In this paper, we propose a system that can detect masked face using transfer learning of Yolov5s and recognize the user using transfer learning of Facenet. Transfer learning preforms by changing the learning rate, epoch, and batch size, their results are evaluated, and the best model is selected as representative model. It has been confirmed that the proposed model is good at detecting masked face and masked face recognition.

Dominant Color Transform and Circular Pattern Vector: Applications to Traffic Sign Detection and Symbol Recognition

  • An, Jung-Hak;Park, Tae-Young
    • Journal of Electrical Engineering and information Science
    • /
    • 제3권1호
    • /
    • pp.73-79
    • /
    • 1998
  • In this paper, a new traffic sign detection algorithm.. and a symbol recognition algorithm are proposed. For traffic sign detection, a dominant color transform is introduced, which serves as a tool of highlighting a dominant primary color, while discarding the other two primary colors. For symbol recognition, the curvilinear shape distribution on a circle centered on the centroid of symbol, called a circular pattern vector, is used as a spatial feature of symbol. The circular pattern vector is invariant to scaling, translation, and rotation. As simulation results, the effectiveness of traffic sign detection and recognition algorithms are confirmed, and it is shown that group of circular patter vectors based on concentric circles is more effective than circular pattern vector of a single circle for a given equivalent number of elements of vectors.

  • PDF

현금 인출기 적용을 위한 얼굴인식 알고리즘 (Face Detection Algorithm for Automatic Teller Machine(ATM))

  • 이혁범;유지상
    • 한국통신학회논문지
    • /
    • 제25권6B호
    • /
    • pp.1041-1049
    • /
    • 2000
  • A face recognition algorithm for the user identification procedure of automatic teller machine(ATM), as an application of the still image processing techniques is proposed in this paper. In the proposed algorithm, face recognition techniques, especially, face region detection, eye and mouth detection schemes, which can distinguish abnormal faces from normal faces, are proposed. We define normal face, which is acceptable, as a face without sunglasses or a mask, and abnormal face, which is non-acceptable, as that wearing both, or either one of them. The proposed face recognition algorithm is composed of three stages: the face region detection stage, the preprocessing stage for facial feature detection and the eye and mouth detection stage. Experimental results show that the proposed algorithm can distinguish abnormal faces from normal faces accurately from restrictive sample images.

  • PDF

A Novel Text Sample Selection Model for Scene Text Detection via Bootstrap Learning

  • Kong, Jun;Sun, Jinhua;Jiang, Min;Hou, Jian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권2호
    • /
    • pp.771-789
    • /
    • 2019
  • Text detection has been a popular research topic in the field of computer vision. It is difficult for prevalent text detection algorithms to avoid the dependence on datasets. To overcome this problem, we proposed a novel unsupervised text detection algorithm inspired by bootstrap learning. Firstly, the text candidate in a novel form of superpixel is proposed to improve the text recall rate by image segmentation. Secondly, we propose a unique text sample selection model (TSSM) to extract text samples from the current image and eliminate database dependency. Specifically, to improve the precision of samples, we combine maximally stable extremal regions (MSERs) and the saliency map to generate sample reference maps with a double threshold scheme. Finally, a multiple kernel boosting method is developed to generate a strong text classifier by combining multiple single kernel SVMs based on the samples selected from TSSM. Experimental results on standard datasets demonstrate that our text detection method is robust to complex backgrounds and multilingual text and shows stable performance on different standard datasets.

Modeling of Positive Selection for the Development of a Computer Immune System and a Self-Recognition Algorithm

  • Sim, Kwee-Bo;Lee, Dong-Wook
    • International Journal of Control, Automation, and Systems
    • /
    • 제1권4호
    • /
    • pp.453-458
    • /
    • 2003
  • The anomaly-detection algorithm based on negative selection of T cells is representative model among self-recognition methods and it has been applied to computer immune systems in recent years. In immune systems, T cells are produced through both positive and negative selection. Positive selection is the process used to determine a MHC receptor that recognizes self-molecules. Negative selection is the process used to determine an antigen receptor that recognizes antigen, or the nonself cell. In this paper, we propose a novel self-recognition algorithm based on the positive selection of T cells. We indicate the effectiveness of the proposed algorithm by change-detection simulation of some infected data obtained from cell changes and string changes in the self-file. We also compare the self-recognition algorithm based on positive selection with the anomaly-detection algorithm.

3D영상 객체인식을 통한 얼굴검출 파라미터 측정기술에 대한 연구 (Object Recognition Face Detection With 3D Imaging Parameters A Research on Measurement Technology)

  • 최병관;문남미
    • 한국컴퓨터정보학회논문지
    • /
    • 제16권10호
    • /
    • pp.53-62
    • /
    • 2011
  • 본 논문에서는 첨단 IT융,복합기술의 발달로 특수 기술로만 여겨졌던 영상객체인식 기술분야가 스마트-폰 기술의 발전과 더불어 개인 휴대용 단말기기로 발전하고 있다. 3D기반의 얼굴인식 검출기술은 객체인식 기술을 통하여 지능형 영상검출 인식기술기술로 진화되고 있음에 따라 영상인식을 통한 얼굴검출기술과 더불어 개발속도가 급속히 발전하고 있다. 본 논문에서는 휴먼인식기술을 기반으로 한 얼굴객체인식 영상검출을 통한 얼굴인식처리 기술의 인지 적용기술을 IP카메라에 적용하여 인가자의 입,출입등의 식별능력을 적용한 휴먼인식을 적용한 얼굴측정 기술에 대한 연구방안을 제안한다. 연구방안은 1)얼굴모델 기반의 얼굴 추적기술을 개발 적용하였고 2)개발된 알고리즘을 통하여 PC기반의 휴먼인식 측정 연구를 통한 기본적인 파라미터 값을 CPU부하에도 얼굴 추적이 가능하며 3)양안의 거리 및 응시각도를 실시간으로 추적할 수 있는 효과를 입증하였다.

연령별 짠맛 역치, 짠맛 미각판정치와 짜게 먹는 식행동과의 상관성 분석 (Correlations Among Threshold and Assessment for Salty Taste and High-salt Dietary Behavior by Age)

  • 지앙린;정윤영;이연경
    • 대한지역사회영양학회지
    • /
    • 제21권1호
    • /
    • pp.75-83
    • /
    • 2016
  • Objectives: The purpose of this study was to analyze correlation thresholds and assessment for salty taste and high-salt dietary behaviors by age. Methods: A total of 524 subjects including 100 each of elementary school students, middle school students, college students, and elderly as well as 124 adults were surveyed for detection and recognition thresholds, salty taste assessments, and high-salt dietary behaviors. Results: Elementary students had a lower detection threshold (p<0.05) and recognition threshold (p<0.01) than did the other groups. Salty taste assessments were lowest among elementary students, followed by middle school students, while college students, adults, and elderly had higher assessment score (p<0.001). Elementary students had significantly lower scores for high-salt dietary behavior than did middle school students, college students, adults and elderly (p<0.001). Middle school students had higher scores for high-salt dietary behavior than did elementary school students and elderly (p<0.001) but no meaningful difference was found in dietary behavior scores between college students, adults, and elderly. There were positive correlations between high-salt dietary behavior and detection thresholds (p<0.001), recognition thresholds (p<0.001), and salty taste assessment (p<0.001). High-salt dietary behavior was more positively correlated with salty taste assessment than detection and recognition thresholds for salty taste. Conclusions: This study suggested that salty taste assessments were positively associated with scores for the detection and recognition thresholds and high-salt dietary behavior.

비디오 영상 기반의 얼굴 검색 (Face Detection based on Video Sequence)

  • 안효창;이상범
    • 반도체디스플레이기술학회지
    • /
    • 제7권3호
    • /
    • pp.45-49
    • /
    • 2008
  • Face detection and tracking technology on video sequence has developed indebted to commercialization of teleconference, telecommunication, front stage of surveillance system using face recognition, and video-phone applications. Complex background, color distortion by luminance effect and condition of luminance has hindered face recognition system. In this paper, we have proceeded to research of face recognition on video sequence. We extracted facial area using luminance and chrominance component on $YC_bC_r$ color space. After extracting facial area, we have developed the face recognition system applied to our improved algorithm that combined PCA and LDA. Our proposed algorithm has shown 92% recognition rate which is more accurate performance than previous methods that are applied to PCA, or combined PCA and LDA.

  • PDF