통합 검색 | Korea Science

Extensible Hierarchical Method of Detecting Interactive Actions for Video Understanding

Moon, Jinyoung;Jin, Junho;Kwon, Yongjin;Kang, Kyuchang;Park, Jongyoul;Park, Kyoung
- ETRI Journal
- /
- 제39권4호
- /
- pp.502-513
- /
- 2017
For video understanding, namely analyzing who did what in a video, actions along with objects are primary elements. Most studies on actions have handled recognition problems for a well-trimmed video and focused on enhancing their classification performance. However, action detection, including localization as well as recognition, is required because, in general, actions intersect in time and space. In addition, most studies have not considered extensibility for a newly added action that has been previously trained. Therefore, proposed in this paper is an extensible hierarchical method for detecting generic actions, which combine object movements and spatial relations between two objects, and inherited actions, which are determined by the related objects through an ontology and rule based methodology. The hierarchical design of the method enables it to detect any interactive actions based on the spatial relations between two objects. The method using object information achieves an F-measure of 90.27%. Moreover, this paper describes the extensibility of the method for a new action contained in a video from a video domain that is different from the dataset used.
https://doi.org/10.4218/etrij.17.0116.0054 인용 PDF KSCI

주방 환기에 대한 조리사들의 인식도 연구 - 서울 지역을 대상으로 - (A Study on the Recognition of Cooks about the Kitchen Ventilation)

반주원;허준
- 한국조리학회지
- /
- 제11권4호
- /
- pp.228-244
- /
- 2005
The objective of the study is to observe cooks first from the kitchen and those who care health and to grasp the impression regarding the kitchen environment. Also, it grasped the recognition degree of cooks against the elements of kitchen ventilation and kitchen ventilation equipments. We surveyed against 385 cooks who work in the kitchen of special grade hotels and family restaurants, used 5 scales for the object, and executed the analysis. The results of this study are as follows: (1) Health condition of the cooks appeared most highly pain in the shoulder and the neck. (2) for the impression regarding the kitchen environment, temperature was high and the insufficiency of ventilation is answered highly. (3) The importance of kitchen ventilation of cooks was recognized very high. (4) The combustion gas was recognized as very high percentage and the most effective element of the kitchen on the human body to remove first inside the kitchen. (5) Most cooks were recognized that the improvement of ventilation equipments is necessary. (6) The object of ventilation equipments is appeared to maintain comfortable kitchen environment. (7) The optimum operation method of ventilation equipment uses the automatic system of ventilation equipment from the kitchen and it is necessary to maintain the optimum. This research is based on preceding studies, investigating special grade hotels and family restaurants in Seoul. The ventilation plan of the kitchen should be accomplished to improve the health of cooks and productivity.
PDF

이동로봇을 위한 위치 및 물체인식용 지능형 센서 제어 시스템 (An intelligent sensor controller of mobile robot for object recognition in an indoor known environment)

정태철;박종석;현웅근
- 한국정보통신학회논문지
- /
- 제9권7호
- /
- pp.1479-1484
- /
- 2005
본 논문은 이동로봇을 위한 위치 및 물체인식용 지능형 센서 제어 시스템에 대해 기술한다. 개발된 센서시스템은 저가의 광 PSD(Position sensitive detector)를 사용하였다. PSD센서는 저가이고, 가볍다는 장점이 있지만 많은 noise를 갖고 있다. 본 논문에서는 이러한 noise를 효과적으로 제거하기 위해 hardware filter와 software filter를 제안한다. 또한 선분기반 map building을 위해 개선된 Hough transform 알고리즘과 이동로봇의 실내 환경에서의 navigation 알고리즘을 제안한다. 개발된 시스템은 실험을 통해 증명하였다.
PDF KSCI

물체인식 딥러닝 모델 구성을 위한 파이썬 기반의 Annotation 툴 개발 (Development of Python-based Annotation Tool Program for Constructing Object Recognition Deep-Learning Model)

임송원;박구만
- 방송공학회논문지
- /
- 제25권3호
- /
- pp.386-398
- /
- 2020
본 논문에서는 물체인식 딥러닝 모델을 구성하는데 필요한 데이터 레이블링 과정을 하나의 프로그램에서 사용할 수 있는 Annotation 툴을 개발했다. 프로그램의 인터페이스는 파이썬의 기본 GUI 라이브러리를 활용하였으며, 실시간으로 데이터 수집이 가능한 크롤러 기능을 구성하였다. 기존의 물체인식 딥러닝 모델인 Retinanet을 활용하여, 자동으로 Annotation 정보를 제공하는 기능을 구현했다. 또한, 다양한 물체인식 네트워크의 레이블링 형식에 맞추어 학습할 수 있도록 Pascal-VOC, YOLO, Retinanet 등 제각기 다른 학습 데이터 레이블링 형식을 저장하도록 했다. 제안하는 방식을 통해 국산 차량 이미지 데이터셋을 구축했으며, 기존의 물체인식 딥러닝 네트워크인 Retinanet과 YOLO 등에 학습하고, 정확도를 측정했다. 차량이 진입하는 영상에서 실시간으로 차량의 모델을 구별하는 정확성은 약 94%의 정확도를 기록했다.
https://doi.org/10.5909/JBE.2020.25.3.386 인용 PDF KSCI KPUBS

랜드마크 이미지 AI 학습용 데이터 구축을 위한 메타데이터 표준 설계 방안 연구 (A Study on Designing Metadata Standard for Building AI Training Dataset of Landmark Images)

김진묵
- 한국문헌정보학회지
- /
- 제54권2호
- /
- pp.419-434
- /
- 2020
본 연구의 목적은 랜드마크 이미지의 AI 학습용 데이터 구축을 위한 메타데이터 표준 설계 방안을 제시하기 위함이다. 이를 위해, 이미지 검색시스템의 종류와 각각의 색인 방식에 관한 최신 기술 현황을 포괄적으로 조사하여 분석하고, AI 머신러닝을 적용한 랜드마크 인식에 필수적인 학습용 공개 데이터셋과 이미지 객체 인식에 관한 기계학습 도구를 조사하였다. 이를 통해, 랜드마크 이미지 AI 학습용 데이터에 최적화된 메타데이터 요소를 선정하고 각각의 요소에 대한 입력 데이터를 정의하였다. 결론 및 제언에서는 랜드마크 인식을 활용한 추천시스템을 포함한 응용서비스 개발 방안을 논의하였다.
https://doi.org/10.4275/KSLIS.2020.54.2.419 인용 PDF KSCI

스마트폰 자이로센서를 이용한 시각장애인용 광학문자인식 방법 (An Optical Character Recognition Method using a Smartphone Gyro Sensor for Visually Impaired Persons)

권순각;김흥준
- 한국산업정보학회논문지
- /
- 제21권4호
- /
- pp.13-20
- /
- 2016
현대 사회에서 스마트폰은 장착된 고화질의 카메라를 이용하여 광학문자인식시스템을 구현할 수 있다. 광학문자시스템으로부터 인식된 문자들은 또한 TTS를 이용하여 시각장애인들에게 음성 서비스를 제공할 수 있다. 문자 정보가 들어있는 객체에 대하여 스마트 폰 카메라를 사용하여 촬영하는 것도 시각장애인들에게는 다소 어려운 일이다. 왜냐하면 피사체의 촬영 이미지를 볼 수가 없기 때문이다. 이러한 문제점을 해결하기 위하여 본 논문에서는 스마트폰의 자이로 센서를 사용하여 시각장애인들의 올바른 촬영을 유도하는 방법을 제안한다. 구현된 프로그램을 사용하여 모의 실험한 결과, 제안된 방법은 같은 객체로부터 보다 많은 문자를 인식하는 것을 확인할 수 있었다.
https://doi.org/10.9723/jksiis.2016.21.4.013 인용 PDF KSCI

이동로봇을 위한 위치 및 물체인식용 지능형 센서 제어 시스템 (An intelligent sensor controller of mobile robot for object recognition in an indoor known environment)

정태철;박종석;현웅근
- 한국정보통신학회:학술대회논문집
- /
- 한국해양정보통신학회 2005년도 추계종합학술대회
- /
- pp.191-194
- /
- 2005
본 논문은 이동로봇을 위한 위치 및 물체인식용 지능형 센서 제어 시스템에 대해 기술한다. 개발된 센서시스템은 저가의 광PSD(Position sensitive detector)를 사용하였다. PSD센서는 저가이고, 가볍다는 장점이 있지만 많은 noise를 갖고 있다. 본 논문에서는 이러한 noise를 효과적으로 제거하기 위해 hardware filter와 software filter를 제안한다. 또한 선분기반 map building을 위해 개선된 Hough transform 알고리즘과 이동로봇의 실내 환경에서의 navigation 알고리즘을 제안한다. 개발된 시스템은 실험을 통해 증명하였다.
PDF

YOLOv4 알고리즘을 이용한 저품질 자동차 번호판 영상의 숫자 및 문자영역 검출 (Detecting Numeric and Character Areas of Low-quality License Plate Images using YOLOv4 Algorithm)

이정환
- 디지털산업정보학회논문지
- /
- 제18권4호
- /
- pp.1-11
- /
- 2022
Recently, research on license plate recognition, which is a core technology of an intelligent transportation system(ITS), is being actively conducted. In this paper, we propose a method to extract numbers and characters from low-quality license plate images by applying the YOLOv4 algorithm. YOLOv4 is a one-stage object detection method using convolution neural network including BACKBONE, NECK, and HEAD parts. It is a method of detecting objects in real time rather than the previous two-stage object detection method such as the faster R-CNN. In this paper, we studied a method to directly extract number and character regions from low-quality license plate images without additional edge detection and image segmentation processes. In order to evaluate the performance of the proposed method we experimented with 500 license plate images. In this experiment, 350 images were used for training and the remaining 150 images were used for the testing process. Computer simulations show that the mean average precision of detecting number and character regions on vehicle license plates was about 93.8%.
https://doi.org/10.17662/ksdim.2022.18.4.001 인용 PDF KSCI HTML

딥러닝 영상인식을 이용한 도로 위 위험 객체 알림 시스템 (Development of recognition and alert system for dangerous road object using deep learning algorithms)

김중완;조현준;황보욱;정준호;최종건;윤태진
- 한국컴퓨터정보학회:학술대회논문집
- /
- 한국컴퓨터정보학회 2022년도 제66차 하계학술대회논문집 30권2호
- /
- pp.479-480
- /
- 2022
고속으로 차량이 주행하는 도로에서 정지 차량이나 낙하물은 큰 사고를 유발하기에 이에 대한 대처 방안이 요구되고 있다. 갑작스런 정지 차량의 경우 예상 불가능하며, 낙하물은 순찰대를 편성하여 주기적으로 수거하고 있으나 즉각적인 대응이 어렵다. 해당 문제 해결을 위해 본 논문에서는 딥러닝 실시간 객체인식기술을 적용하여 정지 차량 및 도로 위 낙하물을 인식하며 이에 대한 정보를 제공하는 시스템을 개발하였다. 실시간 객체인식 알고리즘인 YOLOX와 실시간 객체추적기술인 deepSORT 알고리즘을 데스크톱 PC에 적용하여 구현하였다. 개발한 시스템은 정지 차량 및 낙하물에 대한 인식 결과를 제공한다. 기존 설치된 CCTV 영상을 대상으로 시스템 적용이 가능하여 저비용으로 넓은 지역에 대한 도로 위험 상황 인식을 기대할 수 있다.
PDF

스케치를 이용한 웹 환경에서의 3차원 모델 검색 (Web-based 3D Object Retrieval from User-drawn Sketch Query)

송종헌;주재호;윤상민
- 정보과학회 논문지
- /
- 제41권10호
- /
- pp.838-846
- /
- 2014
터치기반 스마트 기기의 발달에 따라, 사용자가 펜/손가락을 이용하여 그린 스케치를 기반으로 다양한 멀티미디어 검색 기술은 컴퓨터 비전, 컴퓨터 그래픽스, 패턴인식, HCI 분야에서 많은 각광을 받고 있다. 하지만, 기존의 텍스트 정보를 기반으로 한 검색 시스템은 사용자가 원하는 멀티미디어 데이터를 정확히 검색하는데 한계가 있다. 따라서, 멀티미디어 자체가 가지고 있는 정보를 이용하여 검색할 수 있는 내용 기반 멀티미디어 검색에 관한 연구가 필요하게 되었다. 본 논문에서는 Hybrid Edge Descriptor(HED)를 사용한 웹 환경에서의 사용자가 스케치로부터 3차원 모델을 검색할 수 있는 시스템을 제안한다. 3차원 모델로부터 다양한 방향으로 투영된 suggestive contour 영상 및 사용자가 그린 스케치 영상으로부터 전역/지역 히스토그램 분석을 이용한 HED 검색자를 통해 회전 및 이동에 강인한 3차원 모델 검색 시스템을 제안한다.
https://doi.org/10.5626/JOK.2014.41.10.838 인용

검색결과 714건 처리시간 0.035초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)