• Title/Summary/Keyword: 카메라 기반 인식

Search Result 702, Processing Time 0.029 seconds

Implementation of Virtual Touch Service Using Hand Gesture Recognition (손동작 인식을 이용한 가상 터치 서비스 구현)

  • A-Ra Cho;Seung-Bae Yoo;Byeong-Hun Yun;Hyung-Ju Cho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.10
    • /
    • pp.505-512
    • /
    • 2024
  • As the need for hygiene management increases due to COVID-19, the importance of non-contact services is gaining attention. Hands, a tool for expressing intentions and conveying information, are emerging as an alternative to computer input devices such as the keyboard and mouse. In this study, we propose a method to address public health problems that arise when using unmanned ordering machines by controlling a computer using hand gestures detected through a camera. The focus is on identifying frequently used hand gestures, especially the bending of the index finger. To this end, we develop a non-contact input device using the MediaPipe framework and the long short-term memory (LSTM) model. This approach can identify hand gestures in three-dimensional space and provides scenarios that can be applied to the fields of virtual reality (VR) and augmented reality (AR). It offers improved public health and user experience by presenting methods that can be applied to various situations such as navigation systems and unmanned ordering machines.

Object Detection based on Mask R-CNN from Infrared Camera (적외선 카메라 영상에서의 마스크 R-CNN기반 발열객체검출)

  • Song, Hyun Chul;Knag, Min-Sik;Kimg, Tae-Eun
    • Journal of Digital Contents Society
    • /
    • v.19 no.6
    • /
    • pp.1213-1218
    • /
    • 2018
  • Recently introduced Mask R - CNN presents a conceptually simple, flexible, general framework for instance segmentation of objects. In this paper, we propose an algorithm for efficiently searching objects of images, while creating a segmentation mask of heat generation part for an instance which is a heating element in a heat sensed image acquired from a thermal infrared camera. This method called a mask R - CNN is an algorithm that extends Faster R - CNN by adding a branch for predicting an object mask in parallel with an existing branch for recognition of a bounding box. The mask R - CNN is added to the high - speed R - CNN which training is easy and fast to execute. Also, it is easy to generalize the mask R - CNN to other tasks. In this research, we propose an infrared image detection algorithm based on R - CNN and detect heating elements which can not be distinguished by RGB images. As a result of the experiment, a heat-generating object which can not be discriminated from Mask R-CNN was detected normally.

Implementation of smart security CCTV system based on wireless sensor networks and GPS data (무선 센서 네트워크와 GPS정보를 이용한 스마트 보안 CCTV 시스템 구현)

  • Yoon, Kyung-Hyo;Park, Jin-Hong;Kim, Jungjoon;Seo, Dae-Hwa
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.37 no.8
    • /
    • pp.918-931
    • /
    • 2013
  • The conventional object tracking techniques using PTZ camera detects object movements by analyzing acquired image. However, this technique requires expensive hardware devices to perform a complex image processing. And it is occasionally hard to detect object movements, if an acquired image is low quality or image acquisition is impossible. In this paper, we proposes a smart security CCTV system applying to wireless sensor network technique based on IEEE 802.15.4 standard to overcome the problems of conventional object tracking technique, which enables to track suspicious objects by detecting object movements and GPS data in sensor node. This system enables an efficient control of PTZ camera to observe a wide area, decreasing image processing complexity. Also, wireless sensor network is implemented using mesh networks to increase the efficiency of installing sensor node.

Development of Gas Type Identification Deep-learning Model through Multimodal Method (멀티모달 방식을 통한 가스 종류 인식 딥러닝 모델 개발)

  • Seo Hee Ahn;Gyeong Yeong Kim;Dong Ju Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.525-534
    • /
    • 2023
  • Gas leak detection system is a key to minimize the loss of life due to the explosiveness and toxicity of gas. Most of the leak detection systems detect by gas sensors or thermal imaging cameras. To improve the performance of gas leak detection system using single-modal methods, the paper propose multimodal approach to gas sensor data and thermal camera data in developing a gas type identification model. MultimodalGasData, a multimodal open-dataset, is used to compare the performance of the four models developed through multimodal approach to gas sensors and thermal cameras with existing models. As a result, 1D CNN and GasNet models show the highest performance of 96.3% and 96.4%. The performance of the combined early fusion model of 1D CNN and GasNet reached 99.3%, 3.3% higher than the existing model. We hoped that further damage caused by gas leaks can be minimized through the gas leak detection system proposed in the study.

Understanding characteristics of Korean dance performance by image analysis (영상 분석을 통한 우리 춤동작의 특성 이해)

  • Uhm, Tae-Young;Park, Han-Hoon;Park, Jong-Il;Kim, Un-Mi
    • 한국HCI학회:학술대회논문집
    • /
    • 2006.02a
    • /
    • pp.547-554
    • /
    • 2006
  • 우리 춤은 우리 고유의 정서를 담고 있는 종합예술이므로 우리 춤을 분석하고 이해하는 것은 큰 의미가 있다. 본 논문에서는 기존의 춤 동작의 정량적인 분석을 통한 감정인식 기술을 이용하여 우리 춤에 내포된 감정 패턴의 변화를 살펴본다. 먼저 한국 전통춤으로부터 무용전문가들의 정성적 분석에 기반하여 추출된 우리 춤사위를 정해진 각 감정별로 재구성하여 창작하고 창작된 우리 춤을 무용전문가가 시연한다. 이를 카메라를 이용하여 획득하고, 영상처리를 통해서 시연자의 실루엣을 뽑아낸 후, 정량적 특징량들을 추출한다. 이어 신경회로망을 이용하여 각 감정별 춤사위를 학습 시킨 후, 임의의 춤사위에 내포된 감정을 인식 한다. 본 논문에서는 정면, 좌, 우 세 시점에서 획득된 다시점 영상을 이용하여 학습시킴으로써 보다 안정적으로 동작하는 인식 시스템을 제안한다. 그리고, 시스템에 의해 인식된 감정 패턴과 변화의 정성적 의미를 이해하기 위해 무용전문가들에 의해 정립된 정성적 분석 결과와 비교, 분석한다. 이는 정성적인 분석에만 국한되던 우리 춤의 특성에 대한 이해를 객관적이고 정량화된 분석을 통한 이해의 차원으로 확장시키는 것으로, 우리 춤의 특성을 새롭게 정의하는 계기를 마련할 수 있다. 다양한 장르의 한국 전통춤 가운데 우리 춤을 대표할 수 있는 춤사위를 선정하고, 정성적/정량적으로 분석함으로써 우리 춤의 특성을 이해하기 위한 체계적인 틀을 제공하고자 한다.

  • PDF

Implementation of hand motion recognition-based rock-paper-scissors game using ResNet50 transfer learning (ResNet50 전이학습을 활용한 손동작 인식 기반 가위바위보 게임 구현)

  • Park, Changjoon;Kim, Changki;Son, Seongkyu;Lee, Kyoungjin;Yoo, Heekyung;Gwak, Jeonghwan
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.77-82
    • /
    • 2022
  • GUI(Graphical User Interface)를 대신하는 차세대 인터페이스로서 NUI(Natural User Interace)에 기대가 모이는 것은 자연스러운 흐름이다. 본 연구는 NUI의 손가락 관절을 포함한 손동작 전체를 인식시키기 위해 웹캠과 카메라를 활용하여 다양한 배경과 각도의 손동작 데이터를 수집한다. 수집된 데이터는 전처리를 거쳐 데이터셋을 구축하며, ResNet50 모델을 활용하여 전이학습한 합성곱 신경망(Convolutional Neural Network) 알고리즘 분류기를 설계한다. 구축한 데이터셋을 입력시켜 분류학습 및 예측을 진행하며, 실시간 영상에서 인식되는 손동작을 설계한 모델에 입력시켜 나온 결과를 통해 가위바위보 게임을 구현한다.

  • PDF

Mobile Iris Recognition System Based on the Near Infrared Light Illuminator of Long Wavelength and Band Pass Filter and Performance Evaluations (장파장 근적외선 조명 및 밴드 패스 필터 기반 이동형 홍채 인식 시스템 및 성능 평가)

  • Cho, So-Ra;Nam, Gi-Pyo;Jeong, Dae-Sik;Shin, Kwang-Yong;Park, Kang-Ryoung;Shin, Jae-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.9
    • /
    • pp.1125-1137
    • /
    • 2011
  • Recently, there have been previous research about the iris recognition in mobile device to increase portability, whose accuracy is affected by the quality of iris image. Iris image is affected by illumination environment during the image acquisition. The existing system has high accuracy in indoor environment. However the accuracy is degraded in outdoor environment, because the gray levels of iris patterns in image are changed, and ghost and eyelash shading regions are produced by the sunlight of various wavelengths into iris region. To overcome these problems, we propose new mobile iris camera system which uses the near-infrared (NIR) light illuminator of 850 nm and band pass filter (BPF) of 850 nm. To measure the performance of the proposed system, we compared it to the existing one with the iris images captured in indoor and outdoor sunlight environments in terms of the equal error rates (EER) based on false acceptance rate (FAR) and false rejection rate (FRR). The experimental result showed that the proposed system had the lower EERs than those of previous system by 0.96% (with frontal light in indoors), 4.94% (with frontal light in outdoor), 9.24% (with side light in outdoor), and 7% (with back light in outdoor), respectively.

Video-based Facial Emotion Recognition using Active Shape Models and Statistical Pattern Recognizers (Active Shape Model과 통계적 패턴인식기를 이용한 얼굴 영상 기반 감정인식)

  • Jang, Gil-Jin;Jo, Ahra;Park, Jeong-Sik;Seo, Yong-Ho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.3
    • /
    • pp.139-146
    • /
    • 2014
  • This paper proposes an efficient method for automatically distinguishing various facial expressions. To recognize the emotions from facial expressions, the facial images are obtained by digital cameras, and a number of feature points were extracted. The extracted feature points are then transformed to 49-dimensional feature vectors which are robust to scale and translational variations, and the facial emotions are recognized by statistical pattern classifiers such Naive Bayes, MLP (multi-layer perceptron), and SVM (support vector machine). Based on the experimental results with 5-fold cross validation, SVM was the best among the classifiers, whose performance was obtained by 50.8% for 6 emotion classification, and 78.0% for 3 emotions.

Korean Text Image Super-Resolution for Improving Text Recognition Accuracy (텍스트 인식률 개선을 위한 한글 텍스트 이미지 초해상화)

  • Junhyeong Kwon;Nam Ik Cho
    • Journal of Broadcast Engineering
    • /
    • v.28 no.2
    • /
    • pp.178-184
    • /
    • 2023
  • Finding texts in general scene images and recognizing their contents is a very important task that can be used as a basis for robot vision, visual assistance, and so on. However, for the low-resolution text images, the degradations, such as noise or blur included in text images, are more noticeable, which leads to severe performance degradation of text recognition accuracy. In this paper, we propose a new Korean text image super-resolution based on a Transformer-based model, which generally shows higher performance than convolutional neural networks. In the experiments, we show that text recognition accuracy for Korean text images can be improved when our proposed text image super-resolution method is used. We also propose a new Korean text image dataset for training our model, which contains massive HR-LR Korean text image pairs.

Design and implementation of a 3-axis Motion Sensor based SWAT Hand-signal Motion-recognition System (3축 모션 센서 기반 SWAT 수신호 모션 인식 시스템 설계 및 구현)

  • Yun, June;Pyun, Kihyun
    • Journal of Internet Computing and Services
    • /
    • v.15 no.4
    • /
    • pp.33-42
    • /
    • 2014
  • Hand-signal is an effective communication means in the situation where voice cannot be used for expression especially for soldiers. Vision-based approaches using cameras as input devices are widely suggested in the literature. However, these approaches are not suitable for soldiers that have unseen visions in many cases. in addition, existing special-glove approaches utilize the information of fingers only. Thus, they are still lack for soldiers' hand-signal recognition that involves not only finger motions, but also additional information such as the rotation of a hand. In this paper, we have designed and implemented a new recognition system for six military hand-signal motions, i. e., 'ready', 'move', quick move', 'crawl', 'stop', and 'lying-down'. For this purpose, we have proposed a finger-recognition method and motion-recognition methods. The finger-recognition method discriminate how much each finger is bended, i. e., 'completely flattened', 'slightly flattened', 'slightly bended', and 'completely bended'. The motion-recognition algorithms are based on the characterization of each hand-signal motion in terms of the three axes. Through repetitive experiments, our system have shown 91.2% of correct recognition.