Implementation of hand motion recognition-based rock-paper-scissors game using ResNet50 transfer learning (ResNet50 전이학습을 활용한 손동작 인식 기반 가위바위보 게임 구현)

  • Park, Changjoon;Kim, Changki;Son, Seongkyu;Lee, Kyoungjin;Yoo, Heekyung;Gwak, Jeonghwan
    • Proceedings of the Korean Society of Computer Information Conference
    • 2022.01a
    • pp.77-82
    • 2022
  • GUI(Graphical User Interface)를 대신하는 차세대 인터페이스로서 NUI(Natural User Interace)에 기대가 모이는 것은 자연스러운 흐름이다. 본 연구는 NUI의 손가락 관절을 포함한 손동작 전체를 인식시키기 위해 웹캠과 카메라를 활용하여 다양한 배경과 각도의 손동작 데이터를 수집한다. 수집된 데이터는 전처리를 거쳐 데이터셋을 구축하며, ResNet50 모델을 활용하여 전이학습한 합성곱 신경망(Convolutional Neural Network) 알고리즘 분류기를 설계한다. 구축한 데이터셋을 입력시켜 분류학습 및 예측을 진행하며, 실시간 영상에서 인식되는 손동작을 설계한 모델에 입력시켜 나온 결과를 통해 가위바위보 게임을 구현한다.

Performance Comparison for Exercise Motion classification using Deep Learing-based OpenPose (OpenPose기반 딥러닝을 이용한 운동동작분류 성능 비교)

  • Nam Rye Son;Min A Jung
    • Smart Media Journal
    • v.12 no.7
    • pp.59-67
    • 2023
  • Recently, research on behavior analysis tracking human posture and movement has been actively conducted. In particular, OpenPose, an open-source software developed by CMU in 2017, is a representative method for estimating human appearance and behavior. OpenPose can detect and estimate various body parts of a person, such as height, face, and hands in real-time, making it applicable to various fields such as smart healthcare, exercise training, security systems, and medical fields. In this paper, we propose a method for classifying four exercise movements - Squat, Walk, Wave, and Fall-down - which are most commonly performed by users in the gym, using OpenPose-based deep learning models, DNN and CNN. The training data is collected by capturing the user's movements through recorded videos and real-time camera captures. The collected dataset undergoes preprocessing using OpenPose. The preprocessed dataset is then used to train the proposed DNN and CNN models for exercise movement classification. The performance errors of the proposed models are evaluated using MSE, RMSE, and MAE. The performance evaluation results showed that the proposed DNN model outperformed the proposed CNN model.

Illumination Environment Adaptive Real-time Video Surveillance System for Security of Important Area (중요지역 보안을 위한 조명환경 적응형 실시간 영상 감시 시스템)

  • An, Sung-Jin;Lee, Kwan-Hee;Kwon, Goo-Rak;Kim, Nam-Hyung;Ko, Sung-Jea
    • Journal of the Institute of Electronics Engineers of Korea SP
    • v.44 no.2 s.314
    • pp.116-125
    • 2007
  • In this paper, we propose a illumination environment adaptive real-time surveillance system for security of important area such as military bases, prisons, and strategic infra structures. The proposed system recognizes movement of objects on the bright environments as well as in dark illumination. The procedure of proposed system may be summarized as follows. First, the system discriminates between bright and dark with input image distribution. Then, if the input image is dark, the system has a pre-processing. The Multi-scale Retinex Color Restoration(MSRCR) is processed to enhance the contrast of image captured in dark environments. Secondly, the enhanced input image is subtracted with the revised background image. And then, we take a morphology image processing to obtain objects correctly. Finally, each bounding box enclosing each objects are tracked. The center point of each bounding box obtained by the proposed algorithm provides more accurate tracking information. Experimental results show that the proposed system provides good performance even though an object moves very fast and the background is quite dark.

Eye Region Detection Method in Rotated Face using Global Orientation Information (전역적인 에지 오리엔테이션 정보를 이용한 기울어진 얼굴 영상에서의 눈 영역 추출)

  • Jang, Chang-Hyuk;Park, An-Jin;Kurata Takeshi;Jain Anil K.;Park, Se-Hyun;Kim, Eun-Yi;Yang, Jong-Yeol;Jung, Kee-Chul
    • Journal of Korea Society of Industrial Information Systems
    • v.11 no.4
    • pp.82-92
    • 2006
  • In the field of image recognition, research on face recognition has recently attracted a lot of attention. The most important step in face recognition is automatic eye detection researched as a prerequisite stage. Existing eye detection methods for focusing on the frontal face can be mainly classified into two categories: active infrared(IR)-based approaches and image-based approaches. This paper proposes an eye region detection method in non-frontal faces. The proposed method is based on the edge--based method that shows the fastest computation time. To extract eye region in non-frontal faces, the method uses edge orientationhistogram of the global region of faces. The problem caused by some noise and unfavorable ambient light is solved by using proportion of width and height for local information and relationship between components for global information in approximately extracted region. In experimental results, the proposed method improved precision rates, as solving 3 problems caused by edge information and achieves a detection accuracy of 83.5% and a computational time of 0.5sec per face image using 300 face images provided by The Weizmann Institute of Science.

Development of Cloud Detection Method with Geostationary Ocean Color Imagery for Land Applications (GOCI 영상의 육상 활용을 위한 구름 탐지 기법 개발)

  • Lee, Hwa-Seon;Lee, Kyu-Sung
    • Korean Journal of Remote Sensing
    • v.31 no.5
    • pp.371-384
    • 2015
  • Although GOCI has potential for land surface monitoring, there have been only a few cases for land applications. It might be due to the lack of reliable land products derived from GOCI data for end-users. To use for land applications, it is often essential to provide cloud-free composite over land surfaces. In this study, we proposed a cloud detection method that was very important to make cloud-free composite of GOCI reflectance and vegetation index. Since GOCI does not have SWIR and TIR spectral bands, which are very effective to separate clouds from other land cover types, we developed a multi-temporal approach to detect cloud. The proposed cloud detection method consists of three sequential steps of spectral tests. Firstly, band 1 reflectance threshold was applied to separate confident clear pixels. In second step, thick cloud was detected by the ratio (b1/b8) of band 1 and band 8 reflectance. In third step, average of b1/b8 ratio values during three consecutive days was used to detect thin cloud having mixed spectral characteristics of both cloud and land surfaces. The proposed method provides four classes of cloudiness (thick cloud, thin cloud, probably clear, confident clear). The cloud detection method was validated by the MODIS cloud mask products obtained during the same time as the GOCI data acquisition. The percentages of cloudy and cloud-free pixels between GOCI and MODIS are about the same with less than 10% RMSE. The spatial distributions of clouds detected from the GOCI images were also similar to the MODIS cloud mask products.

Fast Image Pre-processing Algorithms Using SSE Instructions (SSE 명령어를 이용한 영상의 고속 전처리 알고리즘)

  • Park, Eun-Soo;Cui, Xuenan;Kim, Jun-Chul;Im, Yu-Cheong;Kim, Hak-Il
    • Journal of the Institute of Electronics Engineers of Korea SP
    • v.46 no.2
    • pp.65-77
    • 2009
  • This paper proposes fast image processing algorithms using SSE (Streaming SIMD Extensions) instructions. The CPU's supporting SSE instructions have 128bit XMM registers; data included in these registers are processed at the same time with the SIMD (Single Instruction Multiple Data) mode. This paper develops new SIMD image processing algorithms for Mean filter, Sobel horizontal edge detector, and Morphological erosion operation which are most widely used in automated optical inspection systems and compares their processing times. In order to objectively evaluate the processing time, the developed algorithms are compared with OpenCV 1.0 operated in SISD (Single Instruction Single Data) mode, Intel's IPP 5.2 and MIL 8.0 which are fast image processing libraries supporting SIMD mode. The experimental result shows that the proposed algorithms on average are 8 times faster than the SISD mode image processing library and 1.4 times faster than the SIMD fast image processing libraries. The proposed algorithms demonstrate their applicability to practical image processing systems at high speed without commercial image processing libraries or additional hardwares.

Multicomponent RVSP Survey for Imaging Thin Layer Bearing Oil Sand (박층 오일샌드 영상화를 위한 다성분 역VSP 탐사)

  • Jeong, Soo-Cheol;Byun, Joong-Moo
    • Geophysics and Geophysical Exploration
    • v.14 no.3
    • pp.234-241
    • 2011
  • Recently, exploration and development of oil sands are thriving due to high oil price. Because oil sands reservoir usually exists as a thin layer, multicomponent VSP, which has the advantage of the high-resolution around the borehole, is more effective than surface seismic survey in exploring oil sand reservoir. In addition, prestack phase-screen migration is effective for multicomponent seismic data because it is based on an one-way wave equation. In this study, we examined the applicability of the prestack phase-screen migration for multicomponent RVSP data to image the thin oil sand reservoir. As a preprocessing tool, we presented a method for separating P-wave and PS-wave from multicomponent RVSP data by using incidence angle and rotation matrix. To verify it, we have applied the developed wavefield separation method to synthetic data obtained from the velocity model including a horizontal layer and dipping layers. Also, we compared the migrated image by using P-wave with that by using PS-wave. As a result, the PS-wave migrated image has higher resolution and wide coverage than P-wave migrated image. Finally, we have applied the prestack phase-screen migration to the synthetic data from the velocity model simulating oil sand reservoir in Canada. The results show that the PS-wave migrated image describe the top and bottom boundaries of the thin oil sand reservoir more clearly than the P-wave migrated image.

A Study on Optical Condition and preprocessing for Input Image Improvement of Dented and Raised Characters of Rubber Tires (고무타이어 문자열 입력영상 개선을 위한 전처리와 광학조건에 관한 연구)

  • 류한성;최중경;권정혁;구본민;박무열
    • Journal of the Korea Institute of Information and Communication Engineering
    • v.6 no.1
    • pp.124-132
    • 2002
  • In this paper, we present a vision algorithm and method for input image improvement and preprocessing of dented and raised characters on the sidewall of tires. we define optical condition between reflect coefficient and reflectance by the physical vector calculate. On the contrary this work will recognize the engraved characters using the computer vision technique. Tire input images have all most same grey levels between the characters and backgrounds. The reflectance is little from a tire surface. therefore, it's very difficult segment the characters from the background. Moreover, one side of the character string is raised and the other is dented. So, the captured images are varied with the angle of camera and illumination. For optimum Input images, the angle between camera and illumination was found out to be with in 90$^{\circ}$. In addition, We used complex filtering with low-pass and high-pass band filters to improve input images, for clear input images. Finally we define equation reflect coefficient and reflectance. By doing this, we obtained good images of tires for pattern recognition.

Metamorphosis Hierarchical Motion Vector Estimation Algorithm for Multidimensional Image System (다차원 영상 시스템을 위한 변형계층 모션벡터 추정알고리즘)

  • Kim Jeong-Woong;Yang Hae-Sool
    • The KIPS Transactions:PartB
    • v.13B no.2 s.105
    • pp.105-114
    • 2006
  • In ubiquitous environment where various kinds of computers are embedded in persons, objects and environment and they are interconnected and can be used in my place as necessary, different types of data need to be exchanged between heterogeneous machines through home network. In the environment, the efficient processing, transmission and monitoring of image data are essential technologies. We need to make research not only on traditional image processing such as spatial and visual resolution, color expression and methods of measuring image quality but also on transmission rate on home network that has a limited bandwidth. The present study proposes a new motion vector estimation algorithm for transmitting, processing and controlling image data, which is the core part of contents in home network situation and, using algorithm, implements a real time monitoring system of multi dimensional images transmitted from multiple cameras. Image data of stereo cameras to be transmitted in different environment in angle, distance, etc. are preprocessed through reduction, magnification, shift or correction, and compressed and sent using the proposed metamorphosis hierarchical motion vector estimation algorithm for the correction of motion. The proposed algorithm adopts advantages and complements disadvantages of existing motion vector estimation algorithms such as whole range search, three stage search and hierarchical search, and estimates efficiently the motion of images with high variation of brightness using an atypical small size macro block. The proposed metamorphosis hierarchical motion vector estimation algorithm and implemented image systems can be utilized in various ways in ubiquitous environment.

Feasibility Study on Producing 1:25,000 Digital Map Using KOMPSAT-5 SAR Stereo Images (KOMPSAT-5 레이더 위성 스테레오 영상을 이용한 1:25,000 수치지형도제작 가능성 연구)

  • Lee, Yong-Suk;Jung, Hyung-Sup
    • Korean Journal of Remote Sensing
    • v.34 no.6_3
    • pp.1329-1350
    • 2018
  • There have been many applications to observe Earth using synthetic aperture radar (SAR) since it could acquire Earth observation data without reference to weathers or local times. However researches about digital map generation using SAR have hardly been performed due to complex raw data processing. In this study, we suggested feasibility of producing digital map using SAR stereo images. We collected two sets, which include an ascending and a descending orbit acquisitions respectively, of KOMPSAT-5 stereo dataset. In order to suggest the feasibility of digital map generation from SAR stereo images, we performed 1) rational polynomial coefficient transformation from radar geometry, 2) digital resititution using KOMPSAT-5 stereo images, and 3) validation using digital-map-derived reference points and check points. As the results of two models, root mean squared errors of XY and Z direction were less than 1m for each model. We discussed that KOMPSAT-5 stereo image could generated 1:25,000 digital map which meets a standard of the digital map. The proposed results would contribute to generate and update digital maps for inaccessible areas and wherever weather conditions are unstable such as North Korea or Polar region.