• 제목/요약/키워드: Human-object Interaction Detection

Search Result 26, Processing Time 0.027 seconds

Human-Object Interaction Detection Data Augmentation Using Image Concatenation (이미지 이어붙이기를 이용한 인간-객체 상호작용 탐지 데이터 증강)

  • Sang-Baek Lee;Kyu-Chul Lee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.2
    • /
    • pp.91-98
    • /
    • 2023
  • Human-object interaction(HOI) detection requires both object detection and interaction recognition, and requires a large amount of data to learn a detection model. Current opened dataset is insufficient in scale for training model enough. In this paper, we propose an easy and effective data augmentation method called Simple Quattro Augmentation(SQA) and Random Quattro Augmentation(RQA) for human-object interaction detection. We show that our proposed method can be easily integrated into State-of-the-Art HOI detection models with HICO-DET dataset.

Three-stream network with context convolution module for human-object interaction detection

  • Siadari, Thomhert S.;Han, Mikyong;Yoon, Hyunjin
    • ETRI Journal
    • /
    • v.42 no.2
    • /
    • pp.230-238
    • /
    • 2020
  • Human-object interaction (HOI) detection is a popular computer vision task that detects interactions between humans and objects. This task can be useful in many applications that require a deeper understanding of semantic scenes. Current HOI detection networks typically consist of a feature extractor followed by detection layers comprising small filters (eg, 1 × 1 or 3 × 3). Although small filters can capture local spatial features with a few parameters, they fail to capture larger context information relevant for recognizing interactions between humans and distant objects owing to their small receptive regions. Hence, we herein propose a three-stream HOI detection network that employs a context convolution module (CCM) in each stream branch. The CCM can capture larger contexts from input feature maps by adopting combinations of large separable convolution layers and residual-based convolution layers without increasing the number of parameters by using fewer large separable filters. We evaluate our HOI detection method using two benchmark datasets, V-COCO and HICO-DET, and demonstrate its state-of-the-art performance.

Brain Dynamics and Interactions for Object Detection and Basic-level Categorization (물체 탐지와 범주화에서의 뇌의 동적 움직임 추적)

  • Kim, Ji-Hyun;Kwon, Hyuk-Chan;Lee, Yong-Ho
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 2009.05a
    • /
    • pp.219-222
    • /
    • 2009
  • Rapid object recognition is one of the main stream research themes focusing to reveal how human recognizes object and interacts with environment in natural world. This field of study is of consequence in that it is highly important in evolutionary perspective to quickly see the external objects and judge their characteristics to plan future reactions. In this study, we investigated how human detect natural scene objects and categorize them in a limited time frame. We applied Magnetoencepahlogram (MEG) while participants were performing detection (e.g. object vs. texture) or basic-level categorization (e.g. cars vs. dogs) tasks to track the dynamic interaction in human brain for rapid object recognition process. The results revealed that detection and categorization involves different temporal and functional connections that correlated for the successful recognition process as a whole. These results imply that dynamics in the brain are important for our interaction with environment. The implication from this study can be further extended to investigate the effect of subconscious emotional factors on the dynamics of brain interactions during the rapid recognition process.

  • PDF

Moving object detection for biped walking robot flatfrom (이족로봇 플랫폼을 위한 동체탐지)

  • Kang, Tae-Koo;Hwang, Sang-Hyun;Kim, Dong-Won;Park, Gui-Tae
    • Proceedings of the KIEE Conference
    • /
    • 2006.10c
    • /
    • pp.570-572
    • /
    • 2006
  • This paper discusses the method of moving object detection for biped robot walking. Most researches on vision based object detection have mostly focused on fixed camera based algorithm itself. However, developing vision systems for biped walking robot is an important and urgent issue since hired walking robots are ultimately developed not only for researches but to be utilized in real life. In the research, method for moving object detection has been developed for task assignment and execution of biped robot as well as for human robot interaction (HRI) system. But these methods are not suitable to biped walking robot. So, we suggest the advanced method which is suitable to biped walking robot platform. For carrying out certain tasks, an object detecting system using modified optical flow algorithm by wireless vision camera is implemented in a biped walking robot.

  • PDF

Estimating Interest Levels based on Visitor Behavior Recognition Towards a Guide Robot (안내 로봇을 향한 관람객의 행위 인식 기반 관심도 추정)

  • Ye Jun Lee;Juhyun Kim;Eui-Jung Jung;Min-Gyu Kim
    • The Journal of Korea Robotics Society
    • /
    • v.18 no.4
    • /
    • pp.463-471
    • /
    • 2023
  • This paper proposes a method to estimate the level of interest shown by visitors towards a specific target, a guide robot, in spaces where a large number of visitors, such as exhibition halls and museums, can show interest in a specific subject. To accomplish this, we apply deep learning-based behavior recognition and object tracking techniques for multiple visitors, and based on this, we derive the behavior analysis and interest level of visitors. To implement this research, a personalized dataset tailored to the characteristics of exhibition hall and museum environments was created, and a deep learning model was constructed based on this. Four scenarios that visitors can exhibit were classified, and through this, prediction and experimental values were obtained, thus completing the validation for the interest estimation method proposed in this paper.

Tangible AR Interaction based on Fingertip Touch Using Small-Sized Markers (소형 마커를 이용한 손가락 터치 기반 감각형 증강현실 상호작용 방안)

  • Jung, Ho-Kyun;Park, Hyungjun
    • Korean Journal of Computational Design and Engineering
    • /
    • v.18 no.5
    • /
    • pp.374-383
    • /
    • 2013
  • Various interaction techniques have been studied for providing the feeling of touch and improve immersion in augmented reality (AR) environments. Tangible AR interaction exploiting two types (product-type and pointer-type) of simple objects has earned great interest for cost-effective design evaluation of digital handheld products. When the sizes of markers attached to the objects are kept big to obtain better marker recognition, the pointer-type object frequently and significantly occludes the product-type object, which deteriorates natural visualization and level of immersion in an AR environment. In this paper, in order to overcome such problems, we propose tangible AR interaction using fingertip touch combined with small-sized markers. The proposed approach facilitates the use of convex polygons to recover the boundaries of AR markers which are partially occluded. It also properly enlarges the pattern area of each AR marker to reduce the sizes of AR markers without sacrificing the quality of marker detection. We empirically verified the quality of the proposed approach, and applied it in the process of design evaluation of digital products. From experimental results, we found that the approach is comparably accurate enough to be applied to the design evaluation process and tangible enough to provide a pseudo feeling of manipulating virtual products with human hands.

Automatic Detecting and Tracking Algorithm of Joint of Human Body using Human Ratio (인체 비율을 이용한 인체의 조인트 자동 검출 및 객체 추적 알고리즘)

  • Kwak, Nae-Joung;Song, Teuk-Seob
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.4
    • /
    • pp.215-224
    • /
    • 2011
  • There have been studying many researches to detect human body and to track one with increasing interest on human and computer interaction. In this paper, we propose the algorithm that automatically extracts joints, linked points of human body, using the ratio of human body under single camera and tracks object. The proposed method gets the difference images of the grayscale images and ones of the hue images between input image and background image. Then the proposed method composes the results, splits background and foreground, and extracts objects. Also we standardize the ratio of human body using face' length and the measurement of human body and automatically extract joints of the object using the ratio and the corner points of the silhouette of object. After then, we tract the joints' movement using block-matching algorithm. The proposed method is applied to test video to be acquired through a camera and the result shows that the proposed method automatically extracts joints and effectively tracks the detected joints.

Worker Accountability in Computer Vision for Construction Productivity Measurement: A Systematic Review

  • Mik Wanul KHOSIIN;Jacob J. LIN;Chuin-Shan CHEN
    • International conference on construction engineering and project management
    • /
    • 2024.07a
    • /
    • pp.775-782
    • /
    • 2024
  • This systematic review comprehensively analyzes the application of computer vision in construction productivity measurement and emphasizes the importance of worker accountability in construction sites. It identifies a significant gap in the connection level between input (resources) and output data (products or progress) of productivity monitoring, a factor not adequately addressed in prior research. The review highlights three fundamental groups: input, output, and connection groups. Object detection, tracking, pose, and activity recognition, as the input stage, are essential for identifying characteristics and worker movements. The output phase will mostly focus on progress monitoring, and understanding the interaction of workers with other entities will be discussed in the connection groups. This study offers four research future research directions for the worker accountability monitoring process, such as human-object interaction (HOI), generative AI, location-based management systems (LBMS), and robotic technologies. The successful accountability monitoring will secure the accuracy of productivity measurement and elevate the competitiveness of the construction industry.

Automatic Detecting of Joint of Human Body and Mapping of Human Body using Humanoid Modeling (인체 모델링을 이용한 인체의 조인트 자동 검출 및 인체 매핑)

  • Kwak, Nae-Joung;Song, Teuk-Seob
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.4
    • /
    • pp.851-859
    • /
    • 2011
  • In this paper, we propose the method that automatically extracts the silhouette and the joints of consecutive input image, and track joints to trace object for interaction between human and computer. Also the proposed method presents the action of human being to map human body using joints. To implement the algorithm, we model human body using 14 joints to refer to body size. The proposed method converts RGB color image acquired through a single camera to hue, saturation, value images and extracts body's silhouette using the difference between the background and input. Then we automatically extracts joints using the corner points of the extracted silhouette and the data of body's model. The motion of object is tracted by applying block-matching method to areas around joints among all image and the human's motion is mapped using positions of joints. The proposed method is applied to the test videos and the result shows that the proposed method automatically extracts joints and effectively maps human body by the detected joints. Also the human's action is aptly expressed to reflect locations of the joints

A Design of Marker-Based Augmented Reality System Structure using Object Removal Technique (객체 제거 기법을 활용한 마커기반 증강현실 시스템 구조 설계)

  • Kim, Dong-Hyun;Jung, Sung-Mo;Lim, Ji-Hoon;Cagalaban, Giovanni;Leem, Hyo-Young;Geun, So-Geol;Kim, Su-U;Kim, Seok-Soo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2011.10a
    • /
    • pp.621-624
    • /
    • 2011
  • Recently, augmented reality is divided into broadly marker based and markerless based as part of HCI (Human Computer Interaction). Markerless based is augmented object using natural features in real-world environment. On the other hand, Marker based is use to calculate easily the coordinates and exactly augmented object using flat marker of rectangular. However, marker-based image is provided due to the presence of the marker in a markerless, giving users a more less realistic and immersive view. In this paper, We research about combined diminished reality and augmented reality for Marker-Based Augmented Reality System Structure using Object Removal Tchnique in order to increase realistic and immersive view.

  • PDF