• 제목/요약/키워드: Vision-based recognition

검색결과 633건 처리시간 0.03초

문자 인식 향상을 위한 회전 정렬 알고리즘에 관한 연구 (A Study on Rotational Alignment Algorithm for Improving Character Recognition)

  • 진고환
    • 한국융합학회논문지
    • /
    • 제10권11호
    • /
    • pp.79-84
    • /
    • 2019
  • 영상을 기반으로 하는 기술들의 지속적인 발전으로 다양한 분야에서 활용되고 있고, 카메라를 통하여 획득한 영상의 객체를 분석하고 판별하는 비전 시스템의 기술 수요가 급속하게 증가하고 있다. 비전 시스템의 핵심 기술인 영상처리는 반도체 생산 분야의 불량 검사, 타이어 표면의 숫자 및 심볼과 같은 객체 인식 검사 등에 사용되고 있고, 자동차 번호판 인식 등의 연구가 계속하여 이루어지고 있는 실정으로, 객체를 신속, 정확하게 인식할 필요가 있다. 본 논문에서는 곡면과 같은 곳에 마킹되어 있는 숫자나 심볼과 같이 기울어진 객체를 인식하기 위하여 입력된 영상 이미지의 객체 기울기에 대한 각도 값을 확인하여 객체의 회전 정렬을 통한 인식 모델을 제안한다. 제안 모델은 컨투어 알고리즘을 기반으로 객체 영역을 추출하고, 객체의 각도를 산출한 후, 회전 정렬된 이미지에 대한 객체 인식을 진행할 수 있는 모델이다. 향후 연구에서는 기계학습을 통한 탬플릿 매칭 연구가 필요하다.

Pose and Expression Invariant Alignment based Multi-View 3D Face Recognition

  • Ratyal, Naeem;Taj, Imtiaz;Bajwa, Usama;Sajid, Muhammad
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권10호
    • /
    • pp.4903-4929
    • /
    • 2018
  • In this study, a fully automatic pose and expression invariant 3D face alignment algorithm is proposed to handle frontal and profile face images which is based on a two pass course to fine alignment strategy. The first pass of the algorithm coarsely aligns the face images to an intrinsic coordinate system (ICS) through a single 3D rotation and the second pass aligns them at fine level using a minimum nose tip-scanner distance (MNSD) approach. For facial recognition, multi-view faces are synthesized to exploit real 3D information and test the efficacy of the proposed system. Due to optimal separating hyper plane (OSH), Support Vector Machine (SVM) is employed in multi-view face verification (FV) task. In addition, a multi stage unified classifier based face identification (FI) algorithm is employed which combines results from seven base classifiers, two parallel face recognition algorithms and an exponential rank combiner, all in a hierarchical manner. The performance figures of the proposed methodology are corroborated by extensive experiments performed on four benchmark datasets: GavabDB, Bosphorus, UMB-DB and FRGC v2.0. Results show mark improvement in alignment accuracy and recognition rates. Moreover, a computational complexity analysis has been carried out for the proposed algorithm which reveals its superiority in terms of computational efficiency as well.

컴퓨터 모니터용 유리 패널의 문자 마크 인식 (Recognition of Patterns and Marks on the Glass Panel of Computer Monitor)

  • 안인모;이기상
    • 전기학회논문지P
    • /
    • 제52권1호
    • /
    • pp.35-41
    • /
    • 2003
  • In this paper, a machine vision system for recognizing and classifying the patterns and marks engraved by die molding or laser marking on the glass panels of computer monitors is suggested and evaluated experimentally. The vision system is equipped with a neural network and an NGC pattern classifier including searching process based on normalized grayscale correlation and adaptive binarization. This system is found to be applicable even to the cases in which the segmentation of the pattern area from the background using ordinary blob coloring technique is quite difficult. The inspection process is accomplished by the use of the NGC hypothesis and ANN verification. The proposed pattern recognition system is composed of three parts: NGC matching process and the preprocessing unit for acquiring the best quality of binary image data, a neural network-based recognition algorithm, and the learning algorithm for the neural network. Another contribution of this paper is the method of generating the training patterns from only a few typical product samples in place of real images of all types of good products.

패턴인식 필터링을 적용한 물체인식 성능 향상 기법 (A Method for Improving Object Recognition Using Pattern Recognition Filtering)

  • 박진렬;이승기
    • 전자공학회논문지
    • /
    • 제53권6호
    • /
    • pp.122-129
    • /
    • 2016
  • 컴퓨터 비전(Computer vision) 분야에서 물체인식을 위한 많은 알고리즘이 연구되고 있다. 그중 특징점(feature) 기반의 SURF(Speeded Up Robust Features) 알고리즘은 다른 알고리즘에 비해 속도와 정확도 면에서 우수하다. 하지만 SURF 알고리즘은 대응점 검출 시 대응점 오정합으로 물체인식에 실패하는 단점이 있다. 본 논문은 물체 인식률을 향상하기 위하여 SURF와 RANSAC(Random Sample Consensus) 알고리즘을 기반으로 물체인식 시스템을 구현하고, 패턴인식 필터링을 제안하였다. 또한, 실험을 통하여 물체 인식률 향상 결과를 제시하였다.

Real time instruction classification system

  • Sang-Hoon Lee;Dong-Jin Kwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제16권3호
    • /
    • pp.212-220
    • /
    • 2024
  • A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.

3D Facial Landmark Tracking and Facial Expression Recognition

  • Medioni, Gerard;Choi, Jongmoo;Labeau, Matthieu;Leksut, Jatuporn Toy;Meng, Lingchao
    • Journal of information and communication convergence engineering
    • /
    • 제11권3호
    • /
    • pp.207-215
    • /
    • 2013
  • In this paper, we address the challenging computer vision problem of obtaining a reliable facial expression analysis from a naturally interacting person. We propose a system that combines a 3D generic face model, 3D head tracking, and 2D tracker to track facial landmarks and recognize expressions. First, we extract facial landmarks from a neutral frontal face, and then we deform a 3D generic face to fit the input face. Next, we use our real-time 3D head tracking module to track a person's head in 3D and predict facial landmark positions in 2D using the projection from the updated 3D face model. Finally, we use tracked 2D landmarks to update the 3D landmarks. This integrated tracking loop enables efficient tracking of the non-rigid parts of a face in the presence of large 3D head motion. We conducted experiments for facial expression recognition using both framebased and sequence-based approaches. Our method provides a 75.9% recognition rate in 8 subjects with 7 key expressions. Our approach provides a considerable step forward toward new applications including human-computer interactions, behavioral science, robotics, and game applications.

Correlation Extraction from KOSHA to enable the Development of Computer Vision based Risks Recognition System

  • Khan, Numan;Kim, Youjin;Lee, Doyeop;Tran, Si Van-Tien;Park, Chansik
    • 국제학술발표논문집
    • /
    • The 8th International Conference on Construction Engineering and Project Management
    • /
    • pp.87-95
    • /
    • 2020
  • Generally, occupational safety and particularly construction safety is an intricate phenomenon. Industry professionals have devoted vital attention to enforcing Occupational Safety and Health (OHS) from the last three decades to enhance safety management in construction. Despite the efforts of the safety professionals and government agencies, current safety management still relies on manual inspections which are infrequent, time-consuming and prone to error. Extensive research has been carried out to deal with high fatality rates confronting by the construction industry. Sensor systems, visualization-based technologies, and tracking techniques have been deployed by researchers in the last decade. Recently in the construction industry, computer vision has attracted significant attention worldwide. However, the literature revealed the narrow scope of the computer vision technology for safety management, hence, broad scope research for safety monitoring is desired to attain a complete automatic job site monitoring. With this regard, the development of a broader scope computer vision-based risk recognition system for correlation detection between the construction entities is inevitable. For this purpose, a detailed analysis has been conducted and related rules which depict the correlations (positive and negative) between the construction entities were extracted. Deep learning supported Mask R-CNN algorithm is applied to train the model. As proof of concept, a prototype is developed based on real scenarios. The proposed approach is expected to enhance the effectiveness of safety inspection and reduce the encountered burden on safety managers. It is anticipated that this approach may enable a reduction in injuries and fatalities by implementing the exact relevant safety rules and will contribute to enhance the overall safety management and monitoring performance.

  • PDF

핵연료봉 번호인식 시각시스템 개발 및 적용 (A Development and Application of Vision System for the Serial Number Recognition of Nuclear Fuel Tube)

  • 이찬호;최원혁;허종성
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1998년도 추계학술대회 논문집 학회본부 B
    • /
    • pp.520-522
    • /
    • 1998
  • A development and application of machine vision system is introduced, which automatically recognizes the serial number of nuclear fuel tube. For the recognition, a indirect back light illuminating system is designed and a pattern matching algorithm based on neural network is applied. The various operation and management functions are also developed, on a PC under windows OS, for easy operation and data management, respectively. By the successful application of the vision system the productivity of the nuclear fuel tube recognition process is highly improved.

  • PDF

The Effect of Visual Feedback on One-hand Gesture Performance in Vision-based Gesture Recognition System

  • Kim, Jun-Ho;Lim, Ji-Hyoun;Moon, Sung-Hyun
    • 대한인간공학회지
    • /
    • 제31권4호
    • /
    • pp.551-556
    • /
    • 2012
  • Objective: This study presents the effect of visual feedback on one-hand gesture performance in vision-based gesture recognition system when people use gestures to control a screen device remotely. Backgroud: gesture interaction receives growing attention because it uses advanced sensor technology and it allows users natural interaction using their own body motion. In generating motion, visual feedback has been to considered critical factor affect speed and accuracy. Method: three types of visual feedback(arrow, star, and animation) were selected and 20 gestures were listed. 12 participants perform each 20 gestures while given 3 types of visual feedback in turn. Results: People made longer hand trace and take longer time to make a gesture when they were given arrow shape feedback than star-shape feedback. The animation type feedback was most preferred. Conclusion: The type of visual feedback showed statistically significant effect on the length of hand trace, elapsed time, and speed of motion in performing a gesture. Application: This study could be applied to any device that needs visual feedback for device control. A big feedback generate shorter length of motion trace, less time, faster than smaller one when people performs gestures to control a device. So the big size of visual feedback would be recommended for a situation requiring fast actions. On the other hand, the smaller visual feedback would be recommended for a situation requiring elaborated actions.

사물인식을 위한 딥러닝 모델 선정 플랫폼 (Deep Learning Model Selection Platform for Object Detection)

  • 이한솔;김영관;홍지만
    • 스마트미디어저널
    • /
    • 제8권2호
    • /
    • pp.66-73
    • /
    • 2019
  • 최근 컴퓨터 비전을 활용한 사물인식 기술이 센서 기반 사물인식 기술을 대체할 기술로 주목을 받고 있다. 센서 기반 사물인식 기술은 일반적으로 고가의 센서를 필요로 하기 때문에 기술이 상용화되기 어렵다는 문제가 있었다. 반면 컴퓨터 비전을 활용한 사물인식 기술은 고가의 센서 대신 비교적 저렴한 카메라를 사용할 수 있다. 동시에 CNN이 발전하면서 실시간 사물인식이 가능해진 이후 IoT, 자율주행자동차 등 타 분야에 활발하게 도입되고 있다. 그러나 사물 인식 모델을 상황에 알맞게 선택하고 학습시키기 위해서는 딥러닝에 대한 전문적인 지식을 요구하기 때문에 비전문가가 사물 인식 모델을 사용하기에는 어려움이 따른다. 따라서 본 논문에서는 딥러닝 기반 사물인식 모델들의 구조와 성능을 분석하고, 사용자가 원하는 조건의 최적의 딥러닝 기반 사물 인식 모델을 스스로 선정할 수 있는 플랫폼을 제안한다. 또한 통계에 기반한 사물 인식 모델 선정이 필요한 이유를 실험을 통해 증명한다.