• Title/Summary/Keyword: Video Image Detection Technique

Search Result 95, Processing Time 0.032 seconds

Efficient Traffic Lights Detection and Signal Recognition in Moving Image (동영상에서 교통 신호등 위치 검출 및 신호인식 기법)

  • Oh, Seong;Kim, Jin-soo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.717-719
    • /
    • 2015
  • The research and development of the unmanned vehicle is being carried out actively in domestic and foreign countries. The research is being carried out to provide various services so that the weakness of system such as conventional 2D-based navigation systems can be supplemented and the driving can be safer. This paper suggests the method that enables real-time video processing in more efficient way by realizing the location detection and signal recognition technique of traffic signals in video. In order to overcome the limit of conventional methods that have a difficulty in analyzing the signal as it is sensitive to brightness change, the proposed method realizes the program that grasps the depth data in front of the vehicle using video processing, analyzes the signal by detecting traffic signal and estimates color components of traffic signal in front and the distance between traffic signal and the vehicle.

  • PDF

A Development of Video Monitoring System on Real Time (실시간 영상감시 시스템 개발)

  • Cho, Hyun-Seob
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.8 no.2
    • /
    • pp.240-244
    • /
    • 2007
  • Non-intrusive methods based on active remote IR illumination fur eye tracking is important for many applications of vision-based man-machine interaction. One problem that has plagued those methods is their sensitivity to lighting condition change. This tends to significantly limit their scope of application. In this paper, we present a new real-time eye detection and tracking methodology that works under variable and realistic lighting conditions. Based on combining the bright-pupil effect resulted from IR light and the conventional appearance-based object recognition technique, our method can robustly track eyes when the pupils are not very bright due to significant external illumination interferences. The appearance model is incorporated in both eyes detection and tracking via the use of support vector machine and the mean shift tracking. Additional improvement is achieved from modifying the image acquisition apparatus including the illuminator and the camera.

  • PDF

3D conversion of 2D video using depth layer partition (Depth layer partition을 이용한 2D 동영상의 3D 변환 기법)

  • Kim, Su-Dong;Yoo, Ji-Sang
    • Journal of Broadcast Engineering
    • /
    • v.16 no.1
    • /
    • pp.44-53
    • /
    • 2011
  • In this paper, we propose a 3D conversion algorithm of 2D video using depth layer partition method. In the proposed algorithm, we first set frame groups using cut detection algorithm. Each divided frame groups will reduce the possibility of error propagation in the process of motion estimation. Depth image generation is the core technique in 2D/3D conversion algorithm. Therefore, we use two depth map generation algorithms. In the first, segmentation and motion information are used, and in the other, edge directional histogram is used. After applying depth layer partition algorithm which separates objects(foreground) and the background from the original image, the extracted two depth maps are properly merged. Through experiments, we verify that the proposed algorithm generates reliable depth map and good conversion results.

Research on Local and Global Infrared Image Pre-Processing Methods for Deep Learning Based Guided Weapon Target Detection

  • Jae-Yong Baek;Dae-Hyeon Park;Hyuk-Jin Shin;Yong-Sang Yoo;Deok-Woong Kim;Du-Hwan Hur;SeungHwan Bae;Jun-Ho Cheon;Seung-Hwan Bae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.7
    • /
    • pp.41-51
    • /
    • 2024
  • In this paper, we explore the enhancement of target detection accuracy in the guided weapon using deep learning object detection on infrared (IR) images. Due to the characteristics of IR images being influenced by factors such as time and temperature, it's crucial to ensure a consistent representation of object features in various environments when training the model. A simple way to address this is by emphasizing the features of target objects and reducing noise within the infrared images through appropriate pre-processing techniques. However, in previous studies, there has not been sufficient discussion on pre-processing methods in learning deep learning models based on infrared images. In this paper, we aim to investigate the impact of image pre-processing techniques on infrared image-based training for object detection. To achieve this, we analyze the pre-processing results on infrared images that utilized global or local information from the video and the image. In addition, in order to confirm the impact of images converted by each pre-processing technique on object detector training, we learn the YOLOX target detector for images processed by various pre-processing methods and analyze them. In particular, the results of the experiments using the CLAHE (Contrast Limited Adaptive Histogram Equalization) shows the highest detection accuracy with a mean average precision (mAP) of 81.9%.

A Study on the Improvement of Image-Based Water Level Detection Algorithm Using the Region growing (Region growing 기법을 적용한 영상기반 수위감지 알고리즘 개선에 대한 연구)

  • Kim, Okju;Lee, Junwoo;Park, Jinyi;Cho, Myeongheum
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_4
    • /
    • pp.1245-1254
    • /
    • 2020
  • In this study, the limitations of the existing water level detection algorithm using CCTV images were recognized and the water level detection algorithm was improved by applying the Region growing technique. It applied three techniques (Horizontal projection profile, Texture analysis, and Optical flow) to estimate the water area, and the results were analyzed in a comprehensive analysis to select the initial water area. The water level was then continuously detected by the Region growing technique, referring to the initial water area. As a result, it was possible to confirm that the exact level of water was detected without being affected by environmental factors compared to the existing level detection algorithm, which had frequent mis-detection phenomena depending on the surrounding environmental factors. In addition, the water level was detected in the video showing flooded roads in urban areas, not in the video of the river. These results are believed to be able to supplement the difficulty of monitoring at all times with limited manpower by automatically detecting the level of water through numerous CCTV footage installed throughout the country, and to contribute to laying the foundation for preventing disasters caused by torrential rains and typhoons in advance.

Data Augmentation for Tomato Detection and Pose Estimation (토마토 위치 및 자세 추정을 위한 데이터 증대기법)

  • Jang, Minho;Hwang, Youngbae
    • Journal of Broadcast Engineering
    • /
    • v.27 no.1
    • /
    • pp.44-55
    • /
    • 2022
  • In order to automatically provide information on fruits in agricultural related broadcasting contents, instance image segmentation of target fruits is required. In addition, the information on the 3D pose of the corresponding fruit may be meaningfully used. This paper represents research that provides information about tomatoes in video content. A large amount of data is required to learn the instance segmentation, but it is difficult to obtain sufficient training data. Therefore, the training data is generated through a data augmentation technique based on a small amount of real images. Compared to the result using only the real images, it is shown that the detection performance is improved as a result of learning through the synthesized image created by separating the foreground and background. As a result of learning augmented images using images created using conventional image pre-processing techniques, it was shown that higher performance was obtained than synthetic images in which foreground and background were separated. To estimate the pose from the result of object detection, a point cloud was obtained using an RGB-D camera. Then, cylinder fitting based on least square minimization is performed, and the tomato pose is estimated through the axial direction of the cylinder. We show that the results of detection, instance image segmentation, and cylinder fitting of a target object effectively through various experiments.

Implementation of Motion Detection based on Extracting Reflected Light using 3-Successive Video Frames (3개의 연속된 프레임을 이용한 반사된 빛 영역추출 기반의 동작검출 알고리즘 구현)

  • Kim, Chang Min;Lee, Kyu Woong
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.3
    • /
    • pp.133-138
    • /
    • 2016
  • Motion detection algorithms based on difference image are classified into background subtraction and previous frame subtraction. 1) Background subtraction is a convenient and effective method for detecting foreground objects in a stationary background. However in real world scenarios, especially outdoors, this restriction, (i.e., stationary background) often turns out to be impractical since the background may not be stable. 2) Previous frame subtraction is a simple technique for detecting motion in an image. The difference between two frames depends upon the amount of motion that occurs from one frame to the next. Both these straightforward methods fail when the object moves very "slightly and slowly". In order to efficiently deal with the problem, in this paper we present an algorithm for motion detection that incorporates "reflected light area" and "difference image". This reflected light area is generated during the frame production process. It processes multiplex difference image and AND-arithmetic of bitwise. This process incorporates the accuracy of background subtraction and environmental adaptability of previous frame subtraction and reduces noise generation. Also, the performance of the proposed method is demonstrated by the performance assessment of each method using Gait database sample of CASIA.

Panorama Background Generation and Object Tracking using Pan-Tilt-Zoom Camera (Pan-Tilt-Zoom 카메라를 이용한 파노라마 배경 생성과 객체 추적)

  • Paek, In-Ho;Im, Jae-Hyun;Park, Kyoung-Ju;Paik, Jun-Ki
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.3
    • /
    • pp.55-63
    • /
    • 2008
  • This paper presents a panorama background generation and object tracking technique using a Pan-Tilt-Zoom camera. The proposed method estimates local motion vectors rapidly using phase correlation matching at the prespecified multiple local regions, and it makes minimized estimation error by vector quantization. We obtain the required image patches, by estimating the overlapped region using local motion vectors, we can then project the images to cylinder and realign the images to make the panoramic image. The object tracking is performed by extracting object's motion and by separating foreground from input image using background subtraction. The proposed PTZ-based object tracking method can efficiently generated a stable panorama background, which covers up to 360 degree FOV The proposed algorithm is designed for real-time implementation and it can be applied to many commercial applications such as object shape detection and face recognition in various surveillance video systems.

An image-based deep learning network technique for structural health monitoring

  • Lee, Dong-Han;Koh, Bong-Hwan
    • Smart Structures and Systems
    • /
    • v.28 no.6
    • /
    • pp.799-810
    • /
    • 2021
  • When monitoring the structural integrity of a bridge using data collected through accelerometers, identifying the profile of the load exerted on the bridge from the vehicles passing over it becomes a crucial task. In this study, the speed and location of vehicles on the deck of a bridge is reconfigured using real-time video to implicitly associate the load applied to the bridge with the response from the bridge sensors to develop an image-based deep learning network model. Instead of directly measuring the load that a moving vehicle exerts on the bridge, the intention in the proposed method is to replace the correlation between the movement of vehicles from CCTV images and the corresponding response by the bridge with a neural network model. Given the framework of an input-output-based system identification, CCTV images secured from the bridge and the acceleration measurements from a cantilevered beam are combined during the process of training the neural network model. Since in reality, structural damage cannot be induced in a bridge, the focus of the study is on identifying local changes in parameters by adding mass to a cantilevered beam in the laboratory. The study successfully identified the change in the material parameters in the beam by using the deep-learning neural network model. Also, the method correctly predicted the acceleration response of the beam. The proposed approach can be extended to the structural health monitoring of actual bridges, and its sensitivity to damage can also be improved through optimization of the network training.

Comparison Analysis of Four Face Swapping Models for Interactive Media Platform COX (인터랙티브 미디어 플랫폼 콕스에 제공될 4가지 얼굴 변형 기술의 비교분석)

  • Jeon, Ho-Beom;Ko, Hyun-kwan;Lee, Seon-Gyeong;Song, Bok-Deuk;Kim, Chae-Kyu;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.5
    • /
    • pp.535-546
    • /
    • 2019
  • Recently, there have been a lot of researches on the whole face replacement system, but it is not easy to obtain stable results due to various attitudes, angles and facial diversity. To produce a natural synthesis result when replacing the face shown in the video image, technologies such as face area detection, feature extraction, face alignment, face area segmentation, 3D attitude adjustment and facial transposition should all operate at a precise level. And each technology must be able to be interdependently combined. The results of our analysis show that the difficulty of implementing the technology and contribution to the system in facial replacement technology has increased in facial feature point extraction and facial alignment technology. On the other hand, the difficulty of the facial transposition technique and the three-dimensional posture adjustment technique were low, but showed the need for development. In this paper, we propose four facial replacement models such as 2-D Faceswap, OpenPose, Deekfake, and Cycle GAN, which are suitable for the Cox platform. These models have the following features; i.e. these models include a suitable model for front face pose image conversion, face pose image with active body movement, and face movement with right and left side by 15 degrees, Generative Adversarial Network.