Search | Korea Science

An Object Tracking Technique on OpenCV-based CMT Algorithm Using Raspberry Pi (라즈베리 파이를 이용한 OpenCV 기반 CMT 알고리즘의 객체 추적 기법)

Song, Gi-Beom;Yang, Yong-Jun;Lee, Sang-Gu
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.11a
- /
- pp.895-898
- /
- 2017
동영상 플랫폼 확산에 따라 영상 콘텐츠 수요와 공급이 폭발적으로 성장하여 영상 콘텐츠 확산에 큰 영향을 미치고 있다. 이와 같은 콘텐츠 확산으로 인해 전문가의 영역에 해당하는 영상 제작 기술이 일반적인 기술의 범주로 인식될 만큼 영상 처리 및 제작에 대한 기술의 발전이 나날이 급증하고 있으며 이러한 기술의 발전에 따라 사람의 수작업을 통해서만 영상을 조절하던 과정 또한 객체 추적 기술을 활용함에 따라 자동화 과정이 가능하게 되었다. 본 연구에서는 객체 추적 알고리즘 중 특징 점을 키포인트로 나눠 객체를 추적하는 OpenCV 기반 CMT 알고리즘으로 라즈베리파이를 활용하여 객체를 추적하는 기법을 소개한다. 본 시스템은 방송용 카메라의 객체 추적에 잘 활용될 수 있다.
https://doi.org/10.3745/PKIPS.y2017m11a.895 인용 PDF

Contextual Object Detection using Deep Learning (딥러닝 기반의 객체 맥락정보 탐지)

Kim, Geonuk;Sin, Jaeyong;Hwang, Gisu;Huh, Yoojin;Oh, Seoung-Jun
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.06a
- /
- pp.120-122
- /
- 2018
이미지에서 단순히 객체탐지를 하는 것이 아닌, 맥락정보를 탐지하는 해내는 것은 이미지 분석 분야에서 활발히 진행해온 연구분야 중 하나이다. 본 논문은 검출된 객체와 사람 간의 맥락 정보를 실시간으로 검출하기 위해 관심있는 객체와 인체의 키포인트를 탐지한 후, 그 두 영역 사이의 거리정보를 이용하여 맥락정보를 추출하는 알고리즘을 제안한다. 이는 CNN으로 이루어진 단일 구조 방식이기에 낮은 시스템 복잡도를 갖는다. 이 방법을 통하여 사람과 연관된 객체 사이의 맥락 정보와 그 위치정보를 출력함으로써 CCTV내 무장한 테러범의 위치나 축구 경기 내 공을 소유한 선수를 찾는 경우 등의 실질적인 이미지 분석에 활용할 수 있다.
PDF

Construction of Dynamic Image Animation Network for Style Transformation Using GAN, Keypoint and Local Affine (GAN 및 키포인트와 로컬 아핀 변환을 이용한 스타일 변환 동적인 이미지 애니메이션 네트워크 구축)

Jang, Jun-Bo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.05a
- /
- pp.497-500
- /
- 2022
High-quality images and videos are being generated as technologies for deep learning-based image style translation and conversion of static images into dynamic images have developed. However, it takes a lot of time and resources to manually transform images, as well as professional knowledge due to the difficulty of natural image transformation. Therefore, in this paper, we study natural style mixing through a style conversion network using GAN and natural dynamic image generation using the First Order Motion Model network (FOMM).
https://doi.org/10.3745/PKIPS.y2022m05a.497 인용 PDF

Virtual Nail Art Using Nail Detection (손톱 검출을 이용한 가상 네일아트)

Mun, Sae-byeol;Heo, Hoon;Oh, Jeong-su
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.10a
- /
- pp.413-415
- /
- 2021
This paper proposes a nail detection algorithm using OpenPose and implements virtual nail art using it. Based on the key points detected by OpenPose, the finger area is detected using skin color characteristics for each finger. The nail region is detected from the edge image of the detected finger region. Then, a virtual nail art is implemented by synthesizing nail tips in the nail area. In a somewhat controlled shooting environment, simulation results show that the proposed algorithm detects nail areas well and implements virtual nail art well.
PDF

A Study on Steganography to Hide Secret Messages in Skeleton Datasets for Action Recognition (행동 인식을 위한 스켈레톤 데이터셋에 비밀 메시지를 은닉하기 위한 스테가노그라피 연구)

Sung, Rakbin;Lee, Daewon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.05a
- /
- pp.157-160
- /
- 2022
딥러닝이 각광받기 시작하면서 인간의 자세와 행동을 인식하고 분류하기 위한 인공지능 기술 또한 급속도로 발전하게 되었다. 영상에서 인간의 자세를 디지털 데이터로 표현할 때 인체의 주요 관절점의 위치와 연결관계를 나타내는 스켈레톤 표현 방식을 주로 사용한다. 본 논문에서는 스켈레톤 데이터에 비밀 메시지를 은닉할 수 있는 스테가노그라피 알고리즘에 대해 소개하고, 스켈레톤을 구성하는 주요 관절점 키포인트를 조작했을 때 행동 인식 인공지능 모델이 어떻게 반응하는지 살펴봄으로써 스켈레톤 데이터에 대한 스테가노그라피 알고리즘의 특성과 보안성에 대해 논의한다.
https://doi.org/10.3745/PKIPS.y2022m05a.157 인용 PDF

Improving Detection Range for Short Baseline Stereo Cameras Using Convolutional Neural Networks and Keypoint Matching (컨볼루션 뉴럴 네트워크와 키포인트 매칭을 이용한 짧은 베이스라인 스테레오 카메라의 거리 센싱 능력 향상)

Byungjae Park
- Journal of Sensor Science and Technology
- /
- v.33 no.2
- /
- pp.98-104
- /
- 2024
This study proposes a method to overcome the limited detection range of short-baseline stereo cameras (SBSCs). The proposed method includes two steps: (1) predicting an unscaled initial depth using monocular depth estimation (MDE) and (2) adjusting the unscaled initial depth by a scale factor. The scale factor is computed by triangulating the sparse visual keypoints extracted from the left and right images of the SBSC. The proposed method allows the use of any pre-trained MDE model without the need for additional training or data collection, making it efficient even when considering the computational constraints of small platforms. Using an open dataset, the performance of the proposed method was demonstrated by comparing it with other conventional stereo-based depth estimation methods.
https://doi.org/10.46670/JSST.2024.33.2.98 인용 PDF

Pose Estimation Techniques for Humanoid Characters in FPS Gaming Environments (인간 캐릭터 포즈 식별: FPS 게임에서의 포즈 추정 기법)

Youjung Han;Minseop Lee;Minsu Cha;Jiyoung Woo
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2024.01a
- /
- pp.29-30
- /
- 2024
본 논문은 Krafton의 PUBG: BATTLEGROUNDS 게임에서 플레이어 분류를 목표로 하며, 포즈 추정기술을 사용하여 일반 플레이어와 봇을 구분한다. 이는 게임에서 직접 수집한 비디오 데이터를 기반으로 하며, 다음과 같은 두 가지 접근 방식을 제안한다. 첫 번째 방법은 동작 시퀀스 분석을 통해, 사용자의 특정동작 패턴을 식별하고 로지스틱 회귀 모델을 활용해 사용자 유형을 분류한다. 두 번째 방법은 YOLO-pose 모델을 사용하여 비디오 데이터에서 키포인트를 추출하고, 이를 LSTM 모델에 적용하여 프레임별로 사용자의 유형을 분류한다. 이러한 이중 접근 방식은 게임의 공정성과 사용자 경험을 향상시키는 새로운 도구를 제공하며, 보다 안전한 게임 환경에 기여할 수 있다. 이 연구는 게임 산업뿐만 아니라 보안 및 모니터링 분야에서도 동작 분석에 대한 혁신적인 접근 방식으로 활용될 잠재력을 가지고 있다.
PDF

A Study on the Estimation of Multi-Object Social Distancing Using Stereo Vision and AlphaPose (Stereo Vision과 AlphaPose를 이용한 다중 객체 거리 추정 방법에 관한 연구)

Lee, Ju-Min;Bae, Hyeon-Jae;Jang, Gyu-Jin;Kim, Jin-Pyeong
- KIPS Transactions on Software and Data Engineering
- /
- v.10 no.7
- /
- pp.279-286
- /
- 2021
Recently, We are carrying out a policy of physical distancing of at least 1m from each other to prevent the spreading of COVID-19 disease in public places. In this paper, we propose a method for measuring distances between people in real time and an automation system that recognizes objects that are within 1 meter of each other from stereo images acquired by drones or CCTVs according to the estimated distance. A problem with existing methods used to estimate distances between multiple objects is that they do not obtain three-dimensional information of objects using only one CCTV. his is because three-dimensional information is necessary to measure distances between people when they are right next to each other or overlap in two dimensional image. Furthermore, they use only the Bounding Box information to obtain the exact coordinates of human existence. Therefore, in this paper, to obtain the exact two-dimensional coordinate value in which a person exists, we extract a person's key point to detect the location, convert it to a three-dimensional coordinate value using Stereo Vision and Camera Calibration, and estimate the Euclidean distance between people. As a result of performing an experiment for estimating the accuracy of 3D coordinates and the distance between objects (persons), the average error within 0.098m was shown in the estimation of the distance between multiple people within 1m.
https://doi.org/10.3745/KTSDE.2021.10.7.279 인용 PDF KSCI

A Study on How to Effectively apply BCMS for small-scale manufacturing under 50 employees (50인 이하 소규모 제조업에 대한 효과적인 BCMS 적용)

Lee, Dae-Hyeong;Cheung, Chong-Soo
- Proceedings of the Korean Society of Disaster Information Conference
- /
- 2017.11a
- /
- pp.255-256
- /
- 2017
본 논문에서는 50인 이하 소규모 제조업에 대해 어떤 방법으로 접근하는 것이 BCMS를 적용 하는데 효과적인지 연구하였다. BCMS의 적용범위가 상당히 광범위하고 현재 BCMS를 적용하고 있는 기업의 규모는 대부분 크며, 기업인들의 인식도 큰 규모만이 BCMS가 적용될 수 있다고 믿고 있다. 이에 실질적으로 한국의 기업군을 대부분 차지하고 있는 50인 이하의 제조업이 어떻게 BCMS를 적용하는 것이 효과적인지를 개발하여 소규모 기업도 어떠한 상황 하에서도 사업이 지속가능할 수 있도록 BCMS를 적용 하고자 한다. 일단 50인 이하 소규모 업체는 조직의 상황이 대기업이나 중견기업에 비해 간단하다. 또한 이해관계자도 대부분 많지 않다. 그렇기 때문에 BCMS를 수립하기 위해 파악하여야 할 핵심 업무나 관련 리스크도 많지 않을 것이다. 여기에 착안을 하여 기존에 대부분 갖추고 있는 품질, 환경, 안전보건시스템과 연계하여 사업연속성관리 시스템을 구축하면 50인 이하 기업도 별도의 추가시스템이 아닌 통합시스템의 일환으로 손쉽게 BCMS를 적용하여 어떠한 리스크에도 사업의 연속성을 갖출 수 있는 능력을 가질 것이다. BCMS에서 요구하는 사항에 대해 개별적인 접근방식보다 기존에 갖추고 있는 시스템을 적용하여 통합시스템 접근방식이 키포인트이다.
PDF

Sign2Gloss2Text-based Sign Language Translation with Enhanced Spatial-temporal Information Centered on Sign Language Movement Keypoints (수어 동작 키포인트 중심의 시공간적 정보를 강화한 Sign2Gloss2Text 기반의 수어 번역)

Kim, Minchae;Kim, Jungeun;Kim, Ha Young
- Journal of Korea Multimedia Society
- /
- v.25 no.10
- /
- pp.1535-1545
- /
- 2022
Sign language has completely different meaning depending on the direction of the hand or the change of facial expression even with the same gesture. In this respect, it is crucial to capture the spatial-temporal structure information of each movement. However, sign language translation studies based on Sign2Gloss2Text only convey comprehensive spatial-temporal information about the entire sign language movement. Consequently, detailed information (facial expression, gestures, and etc.) of each movement that is important for sign language translation is not emphasized. Accordingly, in this paper, we propose Spatial-temporal Keypoints Centered Sign2Gloss2Text Translation, named STKC-Sign2 Gloss2Text, to supplement the sequential and semantic information of keypoints which are the core of recognizing and translating sign language. STKC-Sign2Gloss2Text consists of two steps, Spatial Keypoints Embedding, which extracts 121 major keypoints from each image, and Temporal Keypoints Embedding, which emphasizes sequential information using Bi-GRU for extracted keypoints of sign language. The proposed model outperformed all Bilingual Evaluation Understudy(BLEU) scores in Development(DEV) and Testing(TEST) than Sign2Gloss2Text as the baseline, and in particular, it proved the effectiveness of the proposed methodology by achieving 23.19, an improvement of 1.87 based on TEST BLEU-4.
https://doi.org/10.9717/kmms.2022.25.10.1535 인용 PDF KSCI

Search Result 41, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)