Search | Korea Science

Real-time 3D Pose Estimation of Both Human Hands via RGB-Depth Camera and Deep Convolutional Neural Networks (RGB-Depth 카메라와 Deep Convolution Neural Networks 기반의 실시간 사람 양손 3D 포즈 추정)

Park, Na Hyeon;Ji, Yong Bin;Gi, Geon;Kim, Tae Yeon;Park, Hye Min;Kim, Tae-Seong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2018.10a
- /
- pp.686-689
- /
- 2018
3D 손 포즈 추정(Hand Pose Estimation, HPE)은 스마트 인간 컴퓨터 인터페이스를 위해서 중요한 기술이다. 이 연구에서는 딥러닝 방법을 기반으로 하여 단일 RGB-Depth 카메라로 촬영한 양손의 3D 손 자세를 실시간으로 인식하는 손 포즈 추정 시스템을 제시한다. 손 포즈 추정 시스템은 4단계로 구성된다. 첫째, Skin Detection 및 Depth cutting 알고리즘을 사용하여 양손을 RGB와 깊이 영상에서 감지하고 추출한다. 둘째, Convolutional Neural Network(CNN) Classifier는 오른손과 왼손을 구별하는데 사용된다. CNN Classifier 는 3개의 convolution layer와 2개의 Fully-Connected Layer로 구성되어 있으며, 추출된 깊이 영상을 입력으로 사용한다. 셋째, 학습된 CNN regressor는 추출된 왼쪽 및 오른쪽 손의 깊이 영상에서 손 관절을 추정하기 위해 다수의 Convolutional Layers, Pooling Layers, Fully Connected Layers로 구성된다. CNN classifier와 regressor는 22,000개 깊이 영상 데이터셋으로 학습된다. 마지막으로, 각 손의 3D 손 자세는 추정된 손 관절 정보로부터 재구성된다. 테스트 결과, CNN classifier는 오른쪽 손과 왼쪽 손을 96.9%의 정확도로 구별할 수 있으며, CNN regressor는 형균 8.48mm의 오차 범위로 3D 손 관절 정보를 추정할 수 있다. 본 연구에서 제안하는 손 포즈 추정 시스템은 가상 현실(virtual reality, VR), 증강 현실(Augmented Reality, AR) 및 융합 현실 (Mixed Reality, MR) 응용 프로그램을 포함한 다양한 응용 분야에서 사용할 수 있다.
https://doi.org/10.3745/PKIPS.y2018m10a.686 인용 PDF

A Study of Convergence Technology in Robotic Process Automation for Task Automation (업무 자동화를 위한 RPA 융합 기술 고찰)

Kim, Ki-Bong
- Journal of Convergence for Information Technology
- /
- v.9 no.7
- /
- pp.8-13
- /
- 2019
Recently, In line with the recent trend of the fourth industrial revolution, many companies and institutions have been increasingly applying automated technologies using artificial intelligence to various tasks. Particularly, due to the government's 52-hour workweek system, companies are increasingly struggling with manpower management. Therefore, they are interested in RPA (Robotic Process Automation) for office environment automation for efficient manpower management. It is being introduced in the back-office business in credit card companies, bank, insurance. These RPA solutions require AI-based recognition technology, scripting technology, business software API-related technologies, and various solutions such as Automate One, Automation Anywhere, UiPath, and Blue Prism are provided. This paper analyzes and describes the technology of RPA solution, the market trend, and the efficiency of RPA adoption.
https://doi.org/10.22156/CS4SMB.2019.9.7.008 인용 PDF KSCI HTML

Analysis of Understanding Using Deep Learning Facial Expression Recognition for Real Time Online Lectures (딥러닝 표정 인식을 활용한 실시간 온라인 강의 이해도 분석)

Lee, Jaayeon;Jeong, Sohyun;Shin, You Won;Lee, Eunhye;Ha, Yubin;Choi, Jang-Hwan
- Journal of Korea Multimedia Society
- /
- v.23 no.12
- /
- pp.1464-1475
- /
- 2020
Due to the spread of COVID-19, the online lecture has become more prevalent. However, it was found that a lot of students and professors are experiencing lack of communication. This study is therefore designed to improve interactive communication between professors and students in real-time online lectures. To do so, we explore deep learning approaches for automatic recognition of students' facial expressions and classification of their understanding into 3 classes (Understand / Neutral / Not Understand). We use 'BlazeFace' model for face detection and 'ResNet-GRU' model for facial expression recognition (FER). We name this entire process 'Degree of Understanding (DoU)' algorithm. DoU algorithm can analyze a multitude of students collectively and present the result in visualized statistics. To our knowledge, this study has great significance in that this is the first study offers the statistics of understanding in lectures using FER. As a result, the algorithm achieved rapid speed of 0.098sec/frame with high accuracy of 94.3% in CPU environment, demonstrating the potential to be applied to real-time online lectures. DoU Algorithm can be extended to various fields where facial expressions play important roles in communications such as interactions with hearing impaired people.
https://doi.org/10.9717/kmms.2020.23.12.1464 인용 PDF KSCI HTML

Development and Usability Evaluation of Hand Rehabilitation Training System Using Multi-Channel EMG-Based Deep Learning Hand Posture Recognition (다채널 근전도 기반 딥러닝 동작 인식을 활용한 손 재활 훈련시스템 개발 및 사용성 평가)

Ahn, Sung Moo;Lee, Gun Hee;Kim, Se Jin;Bae, So Jeong;Lee, Hyun Ju;Oh, Do Chang;Tae, Ki Sik
- Journal of Biomedical Engineering Research
- /
- v.43 no.5
- /
- pp.361-368
- /
- 2022
The purpose of this study was to develop a hand rehabilitation training system for hemiplegic patients. We also tried to find out five hand postures (WF: Wrist Flexion, WE: Wrist Extension, BG: Ball Grip, HG: Hook Grip, RE: Rest) in real-time using multi-channel EMG-based deep learning. We performed a pre-processing method that converts to Spider Chart image data for the classification of hand movement from five test subjects (total 1,500 data sets) using Convolution Neural Networks (CNN) deep learning with an 8-channel armband. As a result of this study, the recognition accuracy was 92% for WF, 94% for WE, 76% for BG, 82% for HG, and 88% for RE. Also, ten physical therapists participated for the usability evaluation. The questionnaire consisted of 7 items of acceptance, interest, and satisfaction, and the mean and standard deviation were calculated by dividing each into a 5-point scale. As a result, high scores were obtained in immersion and interest in game (4.6±0.43), convenience of the device (4.9±0.30), and satisfaction after treatment (4.1±0.48). On the other hand, Conformity of intention for treatment (3.90±0.49) was relatively low. This is thought to be because the game play may be difficult depending on the degree of spasticity of the hemiplegic patient, and compensation may occur in patient with weakened target muscles. Therefore, it is necessary to develop a rehabilitation program suitable for the degree of disability of the patient.
https://doi.org/10.9718/JBER.2022.43.5.361 인용 PDF KSCI

Object Detection Algorithm for Explaining Products to the Visually Impaired (시각장애인에게 상품을 안내하기 위한 객체 식별 알고리즘)

Park, Dong-Yeon;Lim, Soon-Bum
- The Journal of the Korea Contents Association
- /
- v.22 no.10
- /
- pp.1-10
- /
- 2022
Visually impaired people have very difficulty using retail stores due to the absence of braille information on products and any other support system. In this paper, we propose a basic algorithm for a system that recognizes products in retail stores and explains them as a voice. First, the deep learning model detects hand objects and product objects in the input image. Then, it finds a product object that most overlapping hand object by comparing the coordinate information of each detected object. We determine that this is a product selected by the user, and the system read the nutritional information of the product as Text-To-Speech. As a result of the evaluation, we confirmed a high performance of the learning model. The proposed algorithm can be actively used to build a system that supports the use of retail stores for the visually impaired.
https://doi.org/10.5392/JKCA.2022.22.10.001 인용 PDF KSCI

A Black Ice Detection Method Using Infrared Camera and YOLO (적외선 카메라와 YOLO를 사용한 블랙아이스 탐지 방법)

Kim, Hyung Gyun;Jang, Min Seok;Lee, Yon Sik
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.25 no.12
- /
- pp.1874-1881
- /
- 2021
Black ice, which occurs mainly on the road, vehicle traffic bridges and tunnel entrances due to the sub-zero temperature due to the slip of the road due to heavy snow, is not recognized because the image of asphalt is transmitted in the driver's view, so the vehicle loses braking power because it causes serious loss of life and property. In this paper, we propose a method to identify the black ice by using infrared camera and to identify the road condition by using deep learning to compensate for the disadvantages of existing black ice detection methods (artificial satellite imaging, checking the pattern of slip by ultrasonic reception, measuring the temperature of the road surface, and checking the difference in friction force of the tire during vehicle driving) and to reduce the size of the sensor to detect black ice.
https://doi.org/10.6109/jkiice.2021.25.12.1874 인용 PDF KSCI

Image Enhancement based on Piece-wise Linear Enhancement Curves for Improved Visibility under Sunlight (햇빛 아래에서 향상된 시인성을 위한 Piece-wise Linear Enhancement Curves 기반 영상 개선)

Lee, Junmin;Song, Byung Cheol
- Journal of Broadcast Engineering
- /
- v.27 no.5
- /
- pp.812-815
- /
- 2022
Images displayed on a digital devices under the sunlight are generally perceived to be darker than the original images, which leads to a decrease in visibility. For better visibility, global luminance compensation or tone mapping adaptive to ambient lighting is required. However, the existing methods have limitations in chrominance compensation and are difficult to use in real world due to their heavy computational cost. To solve these problems, this paper propose a piece-wise linear curves (PLECs)-based image enhancement method to improve both luminance and chrominance. At this time, PLECs are regressed through deep learning and implemented in the form of a lookup table to real-time operation. Experimental results show that the proposed method has better visibility compared to the original image with low computational cost.
https://doi.org/10.5909/JBE.2022.27.5.812 인용 PDF KSCI KPUBS

Analysis of Korea's Artificial Intelligence Competitiveness Based on Patent Data: Focusing on Patent Index and Topic Modeling (특허데이터 기반 한국의 인공지능 경쟁력 분석 : 특허지표 및 토픽모델링을 중심으로)

Lee, Hyun-Sang;Qiao, Xin;Shin, Sun-Young;Kim, Gyu-Ri;Oh, Se-Hwan
- Informatization Policy
- /
- v.29 no.4
- /
- pp.43-66
- /
- 2022
With the development of artificial intelligence technology, competition for artificial intelligence technology patents around the world is intensifying. During the period 2000 ~ 2021, artificial intelligence technology patent applications at the US Patent and Trademark Office have been steadily increasing, and the growth rate has been steeper since the 2010s. As a result of analyzing Korea's artificial intelligence technology competitiveness through patent indices, it is evaluated that patent activity, impact, and marketability are superior in areas such as auditory intelligence and visual intelligence. However, compared to other countries, overall Korea's artificial intelligence technology patents are good in terms of activity and marketability, but somewhat inferior in technological impact. While noise canceling and voice recognition have recently decreased as topics for artificial intelligence, growth is expected in areas such as model learning optimization, smart sensors, and autonomous driving. In the case of Korea, efforts are required as there is a slight lack of patent applications in areas such as fraud detection/security and medical vision learning.
https://doi.org/10.22693/NIAIP.2022.29.4.043 인용 PDF KSCI

AIoT-based High-risk Industrial Safety Management System of Artificial Intelligence (AIoT 기반 고위험 산업안전관리시스템 인공지능 연구)

Yeo, Seong-koo;Park, Dea-Woo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.26 no.9
- /
- pp.1272-1278
- /
- 2022
The government enacted and promulgated the 'Severe Accident Punishment Act' in January 2021 and is implementing this law. However, the number of occupational accidents in 2021 increased by 10.7% compared to the same period of the previous year. Therefore, safety measures are urgently needed in the industrial field. In this study, BLE Mesh networking technology is applied for safety management of high-risk industrial sites with poor communication environment. The complex sensor AIoT device collects gas sensing values, voice and motion values in real time, analyzes the information values through artificial intelligence LSTM algorithm and CNN algorithm, and recognizes dangerous situations and transmits them to the server. The server monitors the transmitted risk information in real time so that immediate relief measures are taken. By applying the AIoT device and safety management system proposed in this study to high-risk industrial sites, it will minimize industrial accidents and contribute to the expansion of the social safety net.
https://doi.org/10.6109/jkiice.2022.26.9.1272 인용 PDF KSCI

Metal Surface Defect Detection and Classification using EfficientNetV2 and YOLOv5 (EfficientNetV2 및 YOLOv5를 사용한 금속 표면 결함 검출 및 분류)

Alibek, Esanov;Kim, Kang-Chul
- The Journal of the Korea institute of electronic communication sciences
- /
- v.17 no.4
- /
- pp.577-586
- /
- 2022
Detection and classification of steel surface defects are critical for product quality control in the steel industry. However, due to its low accuracy and slow speed, the traditional approach cannot be effectively used in a production line. The current, widely used algorithm (based on deep learning) has an accuracy problem, and there are still rooms for development. This paper proposes a method of steel surface defect detection combining EfficientNetV2 for image classification and YOLOv5 as an object detector. Shorter training time and high accuracy are advantages of this model. Firstly, the image input into EfficientNetV2 model classifies defect classes and predicts probability of having defects. If the probability of having a defect is less than 0.25, the algorithm directly recognizes that the sample has no defects. Otherwise, the samples are further input into YOLOv5 to accomplish the defect detection process on the metal surface. Experiments show that proposed model has good performance on the NEU dataset with an accuracy of 98.3%. Simultaneously, the average training speed is shorter than other models.
https://doi.org/10.13067/JKIECS.2022.17.4.577 인용 PDF KSCI

Search Result 654, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)