통합 검색 | Korea Science

딥러닝 기술을 이용한 3차원 객체 추적 기술 리뷰 (A Review of 3D Object Tracking Methods Using Deep Learning)

박한훈
- 융합신호처리학회논문지
- /
- 제22권1호
- /
- pp.30-37
- /
- 2021
카메라 영상을 이용한 3차원 객체 추적 기술은 증강현실 응용 분야를 위한 핵심 기술이다. 영상 분류, 객체 검출, 영상 분할과 같은 컴퓨터 비전 작업에서 CNN(Convolutional Neural Network)의 인상적인 성공에 자극 받아, 3D 객체 추적을 위한 최근의 연구는 딥러닝(deep learning)을 활용하는 데 초점을 맞추고 있다. 본 논문은 이러한 딥러닝을 활용한 3차원 객체 추적 방법들을 살펴본다. 딥러닝을 활용한 3차원 객체 추적을 위한 주요 방법들을 설명하고, 향후 연구 방향에 대해 논의한다.
PDF KSCI

PERSONAL SPACE-BASED MODELING OF RELATIONSHIPS BETWEEN PEOPLE FOR NEW HUMAN-COMPUTER INTERACTION

Amaoka, Toshitaka;Laga, Hamid;Saito, Suguru;Nakajima, Masayuki
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2009년도 IWAIT
- /
- pp.746-750
- /
- 2009
In this paper we focus on the Personal Space (PS) as a nonverbal communication concept to build a new Human Computer Interaction. The analysis of people positions with respect to their PS gives an idea on the nature of their relationship. We propose to analyze and model the PS using Computer Vision (CV), and visualize it using Computer Graphics. For this purpose, we define the PS based on four parameters: distance between people, their face orientations, age, and gender. We automatically estimate the first two parameters from image sequences using CV technology, while the two other parameters are set manually. Finally, we calculate the two-dimensional relationship of multiple persons and visualize it as 3D contours in real-time. Our method can sense and visualize invisible and unconscious PS distributions and convey the spatial relationship of users by an intuitive visual representation. The results of this paper can be used to Human Computer Interaction in public spaces.
PDF

이미지-텍스트 자질을 이용한 행동 포착 비디오 기반 대화시스템 (Audio-Visual Scene Aware Dialogue System Utilizing Action From Vision and Language Features)

임정우;장윤나;손준영;이승윤;박기남;임희석
- 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
- /
- 한국정보과학회언어공학연구회 2023년도 제35회 한글 및 한국어 정보처리 학술대회
- /
- pp.253-257
- /
- 2023
최근 다양한 대화 시스템이 스마트폰 어시스턴트, 자동 차 내비게이션, 음성 제어 스피커, 인간 중심 로봇 등의 실세계 인간-기계 인터페이스에 적용되고 있다. 하지만 대부분의 대화 시스템은 텍스트 기반으로 작동해 다중 모달리티 입력을 처리할 수 없다. 이 문제를 해결하기 위해서는 비디오와 같은 다중 모달리티 장면 인식을 통합한 대화 시스템이 필요하다. 기존의 비디오 기반 대화 시스템은 주로 시각, 이미지, 오디오 등의 다양한 자질을 합성하거나 사전 학습을 통해 이미지와 텍스트를 잘 정렬하는 데에만 집중하여 중요한 행동 단서와 소리 단서를 놓치고 있다는 한계가 존재한다. 본 논문은 이미지-텍스트 정렬의 사전학습 임베딩과 행동 단서, 소리 단서를 활용해 비디오 기반 대화 시스템을 개선한다. 제안한 모델은 텍스트와 이미지, 그리고 오디오 임베딩을 인코딩하고, 이를 바탕으로 관련 프레임과 행동 단서를 추출하여 발화를 생성하는 과정을 거친다. AVSD 데이터셋에서의 실험 결과, 제안한 모델이 기존의 모델보다 높은 성능을 보였으며, 대표적인 이미지-텍스트 자질들을 비디오 기반 대화시스템에서 비교 분석하였다.
PDF

Bitcoin and the Monetary System Revolution Changes

Alotaibi, Leena;Alsalmi, Azhar;Alsuwat, Hatim;Alsuwat, Emad
- International Journal of Computer Science & Network Security
- /
- 제21권6호
- /
- pp.156-160
- /
- 2021
Every day brings a new challenge to the humanities. Life nowadays needs accuracy, privacy, integrity, authenticity, and security to run life systems especially the monetary system. Things now differ from previous centuries. Multiple varieties in digital banking have opened the new and most advanced innovations for human beings. The monetary system is going to developed day by day to facilitate the public. Electronic money has amazed the world and gave a challenge to central banking. For this purpose, there will be a need for strict security, information, and confidence. Blockchain technology has opened new gateways. Bitcoin has become the most famous digital currency, which has created a thunderstorm in digital marketing. Blockchain, as a new Financial Technology, has satisfied all the security issues and satisfied doing business in secure ways that encourage investors to invest and keep the world business wheel. Assessment of the sustainability of implementing Bitcoin in financial institutions will be discussed. Every new system has its pros and cons in which a clear vision of what we are about to use can be sought. Through this research paper, a demonstration of the monetary system evolution, the new ways of doing business, some evidence in a form of academic cases will be demonstrated through comparison a table, a suggested method to transfer to the new system in safe mode will be proposed, and a conclusion will be concluded.
https://doi.org/10.22937/IJCSNS.2021.21.6.21 인용 PDF KSCI

컴퓨터 비전 기술 기반 건설장비 객체 추출 모델 적용 분석 연구 (A Study on the Construction Equipment Object Extraction Model Based on Computer Vision Technology)

강성원;유위성;신윤석
- 한국재난정보학회 논문집
- /
- 제19권4호
- /
- pp.916-923
- /
- 2023
연구목적: 2022년 산업재해 현황 부가통계에서 건설업 사망사고자 현황을 보면 건설업 전체 사망사고자의 27.8%가 건설장비로 인해 발생하고 있다. 현장 대형화, 고층화 등으로 발생하는 순회 및 점검의 한계를 극복하기 위해 컴퓨터 비전 기술을 활용해 건설장비를 추출할 수 있는 모델을 구축하고 해당 모델의 정확도 및 현장 적용성에 대해 분석하고자 한다. 연구방법:본 연구에서는 건설장비 중 굴착기, 덤프트럭, 이동식 크레인의 이미지 데이터를 딥러닝 학습시킨 뒤 학습 결과를 평가 및 분석하고 건설현장에 적용하여 분석한다. 연구결과: 'A' 현장에서는 굴착기 및 덤프트럭의 객체를 추출하였으며, 평균 추출 정확도는 굴착기 81.42%, 덤프트럭 78.23%를 나타냈다. 'B' 현장의 이동식 크레인은 78.14%의 평균 정확도를 보여줬다. 결론: 현장 안전관리의 효율성이 증가할 수 있고, 재해발생 위험요인을 최소화 할 수 있을것이라 본다. 또한, 본 연구를 기반으로 건설현장에 스마트 건설기술 도입에 관한 기초적인 자료로 활용이 가능하다.
https://doi.org/10.15683/kosdi.2023.12.31.916 인용 PDF HTML

Design of Smart Device Assistive Emergency WayFinder Using Vision Based Emergency Exit Sign Detection

이민우;비나야감 마리아판;비투무키자 조셉;이정훈;조주필;차재상
- 한국위성정보통신학회논문지
- /
- 제12권1호
- /
- pp.101-106
- /
- 2017
In this paper, we present Emergency exit signs are installed to provide escape routes or ways in buildings like shopping malls, hospitals, industry, and government complex, etc. and various other places for safety purpose to aid people to escape easily during emergency situations. In case of an emergency situation like smoke, fire, bad lightings and crowded stamped condition at emergency situations, it's difficult for people to recognize the emergency exit signs and emergency doors to exit from the emergency building areas. This paper propose an automatic emergency exit sing recognition to find exit direction using a smart device. The proposed approach aims to develop an computer vision based smart phone application to detect emergency exit signs using the smart device camera and guide the direction to escape in the visible and audible output format. In this research, a CAMShift object tracking approach is used to detect the emergency exit sign and the direction information extracted using template matching method. The direction information of the exit sign is stored in a text format and then using text-to-speech the text synthesized to audible acoustic signal. The synthesized acoustic signal render on smart device speaker as an escape guide information to the user. This research result is analyzed and concluded from the views of visual elements selecting, EXIT appearance design and EXIT's placement in the building, which is very valuable and can be commonly referred in wayfinder system.
PDF KSCI

지능형 엣지 컴퓨팅 기기를 위한 온디바이스 AI 비전 모델의 경량화 방식 분석 (Analysis on Lightweight Methods of On-Device AI Vision Model for Intelligent Edge Computing Devices)

주혜현;강남희
- 한국인터넷방송통신학회논문지
- /
- 제24권1호
- /
- pp.1-8
- /
- 2024
실시간 처리 및 프라이버시 강화를 위해 인공지능 모델을 엣지에서 동작시킬 수 있는 온디바이스 AI 기술이 각광받고 있다. 지능형 사물인터넷 기술이 다양한 산업에 적용되면서 온디바이스 AI 기술을 활용한 서비스가 크게 증가하고 있다. 그러나 일반적인 딥러닝 모델은 추론 및 학습을 위해 많은 연산 자원을 요구하고 있다. 따라서 엣지에 적용되는 경량 기기에서 딥러닝 모델을 동작시키기 위해 양자화나 가지치기와 같은 다양한 경량화 기법들이 적용되어야 한다. 본 논문에서는 다양한 경량화 기법 중 가지치기 기술을 중심으로 엣지 컴퓨팅 기기에서 딥러닝 모델을 경량화하여 적용할 수 있는 방안을 분석한다. 특히, 동적 및 정적 가지치기 기법을 적용하여 경량화된 비전 모델의 추론 속도, 정확도 그리고 메모리 사용량을 시험한다. 논문에서 분석된 내용은 실시간 특성이 중요한 지능형 영상 관제 시스템이나 자율 이동체의 영상 보안 시스템에 적용될 수 있다. 또한 사물인터넷 기술이 적용되는 다양한 서비스와 산업에 더욱 효과적으로 활용될 수 있을 것으로 기대된다.
https://doi.org/10.7236/JIIBC.2024.24.1.1 인용 PDF HTML

Background Subtraction in Dynamic Environment based on Modified Adaptive GMM with TTD for Moving Object Detection

Niranjil, Kumar A.;Sureshkumar, C.
- Journal of Electrical Engineering and Technology
- /
- 제10권1호
- /
- pp.372-378
- /
- 2015
Background subtraction is the first processing stage in video surveillance. It is a general term for a process which aims to separate foreground objects from a background. The goal is to construct and maintain a statistical representation of the scene that the camera sees. The output of background subtraction will be an input to a higher-level process. Background subtraction under dynamic environment in the video sequences is one such complex task. It is an important research topic in image analysis and computer vision domains. This work deals background modeling based on modified adaptive Gaussian mixture model (GMM) with three temporal differencing (TTD) method in dynamic environment. The results of background subtraction on several sequences in various testing environments show that the proposed method is efficient and robust for the dynamic environment and achieves good accuracy.
https://doi.org/10.5370/JEET.2015.10.1.372 인용 PDF KSCI KPUBS HTML

Performance of Human Skin Detection in Images According to Color Spaces

Kim, Jun-Yup;Do, Yong-Tae
- 한국정보기술응용학회:학술대회논문집
- /
- 한국정보기술응용학회 2005년도 6th 2005 International Conference on Computers, Communications and System
- /
- pp.153-156
- /
- 2005
Skin region detection in images is an important process in many computer vision applications targeting humans such as hand gesture recognition and face identification. It usually starts at a pixel-level, and involves a pre-process of color spae transformation followed by a classification process. A color space transformation is assumed to increase separability between skin classes and other classes, to increase similarity among different skin tones, and to bring a robust performance under varying imaging conditions, without any complicated analysis. In this paper, we examine if the color space transformation actually brings those benefits to the problem of skin region detection on a set of human hand images with different postures, backgrounds, people, and illuminations. Our experimental results indicate that color space transfomation affects the skin detection performance. Although the performance depends on camera and surround conditions, normalized [R, G, B] color space may be a good choice in general.
PDF

교통 데이터 수집을 위한 객체 인식 통합 프레임워크 개발 (Development of an Integrated Traffic Object Detection Framework for Traffic Data Collection)

양인철;전우훈;이조영;박지현
- 한국ITS학회 논문지
- /
- 제18권6호
- /
- pp.191-201
- /
- 2019
본 연구에서는 다양한 외부 조건 하에서 촬영된 영상을 대상으로 신속하고 정확하게 교통 객체를 검출하는 교통 객체 검출 통합 프레임워크를 개발하였다. 제안된 프레임워크는 딥러닝 기술 기반의 직접 객체 인식 기술과 다중 객체 추적 기술, 그리고 동영상 전처리 기술로 구성되며, 영상의 안정성, 기상, 촬영 각도 등의 다양한 외부 조건에서 촬영된 영상을 대상으로 승용차, 버스, 트럭, 및 미니밴과 같은 교통 객체를 인식하고, 이를 실시간으로 추적하여 교통량 데이터를 계수한다. 제안된 방법의 성능 검증을 위해 다양한 외부 조건에서 촬영된 영상 8개를 대상으로 제안된 방법의 성능 검증을 수행한 결과, 우천 및 강설을 제외한 모든 조건에서 98% 이상의 높은 정확도를 보이는 것으로 나타났다.
https://doi.org/10.12815/kits.2019.18.6.191 인용 PDF KSCI

검색결과 666건 처리시간 0.026초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)