• 제목/요약/키워드: Learning Object

검색결과 1,565건 처리시간 0.024초

TOD: Trash Object Detection Dataset

  • Jo, Min-Seok;Han, Seong-Soo;Jeong, Chang-Sung
    • Journal of Information Processing Systems
    • /
    • 제18권4호
    • /
    • pp.524-534
    • /
    • 2022
  • In this paper, we produce Trash Object Detection (TOD) dataset to solve trash detection problems. A well-organized dataset of sufficient size is essential to train object detection models and apply them to specific tasks. However, existing trash datasets have only a few hundred images, which are not sufficient to train deep neural networks. Most datasets are classification datasets that simply classify categories without location information. In addition, existing datasets differ from the actual guidelines for separating and discharging recyclables because the category definition is primarily the shape of the object. To address these issues, we build and experiment with trash datasets larger than conventional trash datasets and have more than twice the resolution. It was intended for general household goods. And annotated based on guidelines for separating and discharging recyclables from the Ministry of Environment. Our dataset has 10 categories, and around 33K objects were annotated for around 5K images with 1280×720 resolution. The dataset, as well as the pre-trained models, have been released at https://github.com/jms0923/tod.

Joint frame rate adaptation and object recognition model selection for stabilized unmanned aerial vehicle surveillance

  • Gyu Seon Kim;Haemin Lee;Soohyun Park;Joongheon Kim
    • ETRI Journal
    • /
    • 제45권5호
    • /
    • pp.811-821
    • /
    • 2023
  • We propose an adaptive unmanned aerial vehicle (UAV)-assisted object recognition algorithm for urban surveillance scenarios. For UAV-assisted surveillance, UAVs are equipped with learning-based object recognition models and can collect surveillance image data. However, owing to the limitations of UAVs regarding power and computational resources, adaptive control must be performed accordingly. Therefore, we introduce a self-adaptive control strategy to maximize the time-averaged recognition performance subject to stability through a formulation based on Lyapunov optimization. Results from performance evaluations on real-world data demonstrate that the proposed algorithm achieves the desired performance improvements.

Autonomous pothole detection using deep region-based convolutional neural network with cloud computing

  • Luo, Longxi;Feng, Maria Q.;Wu, Jianping;Leung, Ryan Y.
    • Smart Structures and Systems
    • /
    • 제24권6호
    • /
    • pp.745-757
    • /
    • 2019
  • Road surface deteriorations such as potholes have caused motorists heavy monetary damages every year. However, effective road condition monitoring has been a continuing challenge to road owners. Depth cameras have a small field of view and can be easily affected by vehicle bouncing. Traditional image processing methods based on algorithms such as segmentation cannot adapt to varying environmental and camera scenarios. In recent years, novel object detection methods based on deep learning algorithms have produced good results in detecting typical objects, such as faces, vehicles, structures and more, even in scenarios with changing object distances, camera angles, lighting conditions, etc. Therefore, in this study, a Deep Learning Pothole Detector (DLPD) based on the deep region-based convolutional neural network is proposed for autonomous detection of potholes from images. About 900 images with potholes and road surface conditions are collected and divided into training and testing data. Parameters of the network in the DLPD are calibrated based on sensitivity tests. Then, the calibrated DLPD is trained by the training data and applied to the 215 testing images to evaluate its performance. It is demonstrated that potholes can be automatically detected with high average precision over 93%. Potholes can be differentiated from manholes by training and applying a manhole-pothole classifier which is constructed using the convolutional neural network layers in DLPD. Repeated detection of the same potholes can be prevented through feature matching of the newly detected pothole with previously detected potholes within a small region.

Precision Analysis of NARX-based Vehicle Positioning Algorithm in GNSS Disconnected Area

  • Lee, Yong;Kwon, Jay Hyoun
    • 한국측량학회지
    • /
    • 제39권5호
    • /
    • pp.289-295
    • /
    • 2021
  • Recently, owing to the development of autonomous vehicles, research on precisely determining the position of a moving object has been actively conducted. Previous research mainly used the fusion of GNSS/IMU (Global Positioning System / Inertial Navigation System) and sensors attached to the vehicle through a Kalman filter. However, in recent years, new technologies have been used to determine the location of a moving object owing to the improvement in computing power and the advent of deep learning. Various techniques using RNN (Recurrent Neural Network), LSTM (Long Short-Term Memory), and NARX (Nonlinear Auto-Regressive eXogenous model) exist for such learning-based positioning methods. The purpose of this study is to compare the precision of existing filter-based sensor fusion technology and the NARX-based method in case of GNSS signal blockages using simulation data. When the filter-based sensor integration technology was used, an average horizontal position error of 112.8 m occurred during 60 seconds of GNSS signal outages. The same experiment was performed 100 times using the NARX. Among them, an improvement in precision was confirmed in approximately 20% of the experimental results. The horizontal position accuracy was 22.65 m, which was confirmed to be better than that of the filter-based fusion technique.

YOLO 기반의 광학 음악 인식 기술 및 가상현실 콘텐츠 제작 방법 (YOLO based Optical Music Recognition and Virtual Reality Content Creation Method)

  • 오경민;홍요섭;백건영;전찬준
    • 스마트미디어저널
    • /
    • 제10권4호
    • /
    • pp.80-90
    • /
    • 2021
  • 딥러닝에 기반한 광학 음악 인식 기술(Optical Music Recognition, OMR)을 사용하여 도출된 결과를 가상현실 (Virtual Reality, VR) 게임에 적용시킨 것을 제안한다. 딥러닝 모델은 YOLO v5를 사용했으며 검출되지 않은 객체를 검출하기 위해 Hough transform 사용, 보표 크기 수정 등을 수행한다. 출력된 결과 파일을 사용하여 VR 게임에서 BPM, 최대 콤보 수, 음정과 박자를 분석하여 사용하고 리소스 관리를 위한 Object Pooling 기술을 통해 노트가 밀리는 현상을 방지한다. 광학 음악 인식 기술을 통해 나온 음악 요소로 VR 게임을 제작하여 VR 콘텐츠 제공과 함께 광학 음악 인식의 활용성을 넓히는 것을 확인하였다.

회전 영상 기반 다면 영상 데이터셋 구축 방법 (Multi-faceted Image Dataset Construction Method Based on Rotational Images.)

  • 김지성;허경용;장시웅
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2021년도 추계학술대회
    • /
    • pp.75-77
    • /
    • 2021
  • 딥러닝 기술을 통해 영상 내의 객체를 찾아내기 위해서는 학습을 위한 영상 데이터셋이 필요하다. 객체의 인식률을 높이기 위해서는 많은 양의 영상 학습 데이터가 필요하다. 많은 양의 데이터셋을 구축하는 데에는 많은 비용이 들기 때문에 개인이 구축하기에 어려움이 있다. 본 논문에서는 회전 영상을 촬영하여 객체의 여러 면을 포함하는 영상 데이터셋을 보다 손쉽게 구축하는 방법을 소개한다. 회전판 위에 객체를 올려둔 뒤 촬영하고 촬영된 영상을 필요에 맞게 분할, 합성하여 데이터셋을 구축하는 방법을 제안한다.

  • PDF

화장품 물체 인식을 위한 Two-Stage 딥러닝 기반 알고리즘 (Two-Stage Deep Learning Based Algorithm for Cosmetic Object Recognition)

  • 김종민;서대호
    • 산업경영시스템학회지
    • /
    • 제46권4호
    • /
    • pp.101-106
    • /
    • 2023
  • With the recent surge in YouTube usage, there has been a proliferation of user-generated videos where individuals evaluate cosmetics. Consequently, many companies are increasingly utilizing evaluation videos for their product marketing and market research. However, a notable drawback is the manual classification of these product review videos incurring significant costs and time. Therefore, this paper proposes a deep learning-based cosmetics search algorithm to automate this task. The algorithm consists of two networks: One for detecting candidates in images using shape features such as circles, rectangles, etc and Another for filtering and categorizing these candidates. The reason for choosing a Two-Stage architecture over One-Stage is that, in videos containing background scenes, it is more robust to first detect cosmetic candidates before classifying them as specific objects. Although Two-Stage structures are generally known to outperform One-Stage structures in terms of model architecture, this study opts for Two-Stage to address issues related to the acquisition of training and validation data that arise when using One-Stage. Acquiring data for the algorithm that detects cosmetic candidates based on shape and the algorithm that classifies candidates into specific objects is cost-effective, ensuring the overall robustness of the algorithm.

안내 로봇을 향한 관람객의 행위 인식 기반 관심도 추정 (Estimating Interest Levels based on Visitor Behavior Recognition Towards a Guide Robot)

  • 이예준;김주현;정의정;김민규
    • 로봇학회논문지
    • /
    • 제18권4호
    • /
    • pp.463-471
    • /
    • 2023
  • This paper proposes a method to estimate the level of interest shown by visitors towards a specific target, a guide robot, in spaces where a large number of visitors, such as exhibition halls and museums, can show interest in a specific subject. To accomplish this, we apply deep learning-based behavior recognition and object tracking techniques for multiple visitors, and based on this, we derive the behavior analysis and interest level of visitors. To implement this research, a personalized dataset tailored to the characteristics of exhibition hall and museum environments was created, and a deep learning model was constructed based on this. Four scenarios that visitors can exhibit were classified, and through this, prediction and experimental values were obtained, thus completing the validation for the interest estimation method proposed in this paper.

차량 탑재형 상·하역 장비의 설계와 딥러닝 객체 인식을 이용한 자동제어 방법 (Design of Vehicle-mounted Loading and Unloading Equipment and Autonomous Control Method using Deep Learning Object Detection)

  • 이순교;김선목;우효원;이석;이기백
    • 로봇학회논문지
    • /
    • 제19권1호
    • /
    • pp.79-91
    • /
    • 2024
  • Large warehouses are building automation systems to increase efficiency. However, small warehouses, military bases, and local stores are unable to introduce automated logistics systems due to lack of space and budget, and are handling tasks manually, failing to improve efficiency. To solve this problem, this study designed small loading and unloading equipment that can be mounted on transportation vehicles. The equipment can be controlled remotely and is automatically controlled from the point where pallets loaded with cargo are visible using real-time video from an attached camera. Cargo recognition and control command generation for automatic control are achieved through a newly designed deep learning model. This model is designed to be optimized for loading and unloading equipment and mission environments based on the YOLOv3 structure. The trained model recognized 10 types of palettes with different shapes and colors with an average accuracy of 100% and estimated the state with an accuracy of 99.47%. In addition, control commands were created to insert forks into pallets without failure in 14 scenarios assuming actual loading and unloading situations.

반복 반점화: 지각적 모호성이 물체 재인에 미치는 영향 (Repetition Antipriming: The Effects of Perceptual Ambiguity on Object Recognition)

  • 김구태;이도준
    • 인지과학
    • /
    • 제21권4호
    • /
    • pp.603-625
    • /
    • 2010
  • 물체의 신경 표상은 시각 피질 전반에 걸쳐 분산되어 있고 다른 물체의 신경 표상과 중첩된 형태로 유지된다. 따라서 특정 물체를 반복적으로 접하면 그 물체의 재인은 촉진되지만 다른 물체의 재인은 상대적으로 손상될 수 있다. 이러한 현상을 각각 반복 점화(priming)와 반점화(antipriming)라고 한다. 본 연구는 반복 반점화 즉, 반복된 물체 자체가 반점화 될 가능성을 검증하였다. 학습 단계에서 참가자들은 다양한 수준으로 화질이 손상된 물체 사진들의 재인 수준을 평정하였고 검사 단계에서는 정상적인 화질의 물체 사진을 보고 범주 판단 과제를 실시하였다. 그 결과, 실험 1과 2에서 모두 학습 단계에서 쉽게 재인되었던 물체는 검사 단계에서 더 효율적으로 처리되었지만(반복 점화), 반대로 학습 단계에서 지각적으로 모호했던 물체는 검사 단계에서 비효율적으로 처리되었다(반복 반점화). 이러한 결과는 지각적으로 모호한 물체를 경험할 때 세부특징에 관한 감각 표상과 다수의 물체 표상들 간의 연결이 강화되어 후속 재인 과정을 방해하기 때문인 것으로 추정된다.

  • PDF