• Title/Summary/Keyword: Object detection and tracking

Search Result 444, Processing Time 0.027 seconds

Speech Activity Detection using Lip Movement Image Signals (입술 움직임 영상 선호를 이용한 음성 구간 검출)

  • Kim, Eung-Kyeu
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.4
    • /
    • pp.289-297
    • /
    • 2010
  • In this paper, A method to prevent the external acoustic noise from being misrecognized as the speech recognition object is presented in the speech activity detection process for the speech recognition. Also this paper confirmed besides the acoustic energy to the lip movement image signals. First of all, the successive images are obtained through the image camera for personal computer and the lip movement whether or not is discriminated. The next, the lip movement image signal data is stored in the shared memory and shares with the speech recognition process. In the mean time, the acoustic energy whether or not by the utterance of a speaker is verified by confirming data stored in the shared memory in the speech activity detection process which is the preprocess phase of the speech recognition. Finally, as a experimental result of linking the speech recognition processor and the image processor, it is confirmed to be normal progression to the output of the speech recognition result if face to the image camera and speak. On the other hand, it is confirmed not to the output the result of the speech recognition if does not face to the image camera and speak. Also, the initial feature values under off-line are replaced by them. Similarly, the initial template image captured while off-line is replaced with a template image captured under on-line, so the discrimination of the lip movement image tracking is raised. An image processing test bed was implemented to confirm the lip movement image tracking process visually and to analyze the related parameters on a real-time basis. As a result of linking the speech and image processing system, the interworking rate shows 99.3% in the various illumination environments.

Data Fusion and Pursuit-Evasion Simulations for Position Evaluation of Tactical Objects (전술객체 위치 모의를 위한 데이터 융합 및 추적 회피 시뮬레이션)

  • Jin, Seung-Ri;Kim, Seok-Kwon;Son, Jae-Won;Park, Dong-Jo
    • Journal of the Korea Society for Simulation
    • /
    • v.19 no.4
    • /
    • pp.209-218
    • /
    • 2010
  • The aim of the study on the tactical object representation techniques in synthetic environment is on acquiring fundamental techniques for detection and tracking of tactical objects, and evaluating the strategic situation in the virtual ground. In order to acquire these techniques, there need the tactical objects' position tracking and evaluation, and an inter-sharing technique between tactical models. In this paper, we study the algorithms on the sensor data fusion and coordinate conversion, proportional navigation guidance(PNG), and pursuit-evasion technique for engineering and higher level models. Additionally, we simulate the position evaluation of tractical objects using the pursuit and evasion maneuvers between a submarine and a torpedo.

Edge based Interactive Segmentation (경계선 기반의 대화형 영상분할 시스템)

  • Yun, Hyun Joo;Lee, Sang Wook
    • Journal of the Korea Computer Graphics Society
    • /
    • v.8 no.2
    • /
    • pp.15-22
    • /
    • 2002
  • Image segmentation methods partition an image into meaningful regions. For image composition and analysis, it is desirable for the partitioned regions to represent meaningful objects in terms of human perception and manipulation. Despite the recent progress in image understanding, however, most of the segmentation methods mainly employ low-level image features and it is still highly challenging to automatically segment an image based on high-level meaning suitable for human interpretation. The concept of HCI (Human Computer Interaction) can be applied to operator-assisted image segmentation in a manner that a human operator provides guidance to automatic image processing by interactively supplying critical information about object boundaries. Intelligent Scissors and Snakes have demonstrated the effectiveness of human-assisted segmentation [2] [1]. This paper presents a method for interactive image segmentation for more efficient and effective detection and tracking of object boundaries. The presented method is partly based on the concept of Intelligent Scissors, but employs the well-established Canny edge detector for stable edge detection. It also uses "sewing method" for including weak edges in object boundaries, and 5-direction search to promote more efficient and stable linking of neighboring edges than the previous methods.

  • PDF

Programming Learning Using Image Detection and Tracking Devices (영상 감지 및 추적장치를 이용한 프로그래밍 학습)

  • Song, Young-A;Kim, Young-Chul
    • Journal of Practical Engineering Education
    • /
    • v.7 no.1
    • /
    • pp.17-23
    • /
    • 2015
  • Growing worldwide interest in software and spread awareness of the need for education. A software-driven society realization strategy will achieve mandatory training on software. This policy will be applied from middle school students in 2015. Students who are interested in computers, like many positive responses, student learning may seem hard to have a negative reaction. Thus, the present obligation as simple in practice education only and discarded before actually lost interest awarded a systematic education does not work properly, but rather that education could happen to entirely discard turn our attention. Accordingly, what you need is to have a systematic training from the basics that I need to worry about software that can recognize easy and fun. In this paper, we propose a hardware and basic software that allows you to proceed software training more easy and efficient as it can be modified and applied to the program in the production unit to learn programming tools and programming languages.

Video analysis using re-constructing of motion vectors on MPEG compressed domain (압축영역에서 움직임 벡터의 재추정을 이용한 비디오 해석 기법)

  • Kim, Nak-U;Kim, Tae-Yong;Gang, Eung-Gwan;Choe, Jong-Su
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.3
    • /
    • pp.78-87
    • /
    • 2002
  • A macroblock(MB) in MPEG coded domain can have zero, one, or two motion vectors depending on its frame type and prediction direction (forward-, backward-, or hi-directionally). In this paper, we propose a method that converts these motion vectors on MPEG coded domain as a uniform set, independent of the frame type and the direction of prediction, and directly utilizes these re-analyzed motion vectors for understanding video contents. Also, using this frame-type-independent motion vector, we propose novel methods for detecting and tracking moving objects with frame-based detection accuracy on the compressed domain. These algorithms are performed directly from the MPEG bitstreams after VLC decoding with little time consumption. Experimental results show validity and outstanding performance of our methods.

Development of Unmanned Video Recording System using Mobile (모바일을 이용한 무인 영상 녹화 시스템 개발)

  • Ahn, Byeongtae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.6
    • /
    • pp.254-260
    • /
    • 2019
  • Recently, a self-camera that generates and distributes a large amount of moving images has been rapidly increasing due to the appearance of SNS such as Facebook, Instagram, and Tweet using mobile. In particular, the amount of SNS connections using mobile phones is significantly increasing in terms of usage, number of connections, and usage time. However, the use of a self-recording system using a smartphone by itself is extremely limited not only in terms of usage but also in frequency of use. In addition, the conventional unattended recording system is a very expensive system that automatically records and tracks an object to be photographed using an infrared signal. Therefore, this paper developed a low cost unmanned recording system using mobile phone. The system consists of a commercial mobile camera, a servomotor for moving the camera from side to side, a microcontroller for controlling the motor, and a commercial wireless Bluetooth earset for video audio input. And it is an unmanned automation system using mobile, and anyone can record image by self image tracking.

Image Tracking Based Lane Departure Warning and Forward Collision Warning Methods for Commercial Automotive Vehicle (이미지 트래킹 기반 상용차용 차선 이탈 및 전방 추돌 경고 방법)

  • Kim, Kwang Soo;Lee, Ju Hyoung;Kim, Su Kwol;Bae, Myung Won;Lee, Deok Jin
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.39 no.2
    • /
    • pp.235-240
    • /
    • 2015
  • Active Safety system is requested on the market of the medium and heavy duty commercial vehicle over 4.5ton beside the market of passenger car with advancement of the digital equipment proportionally. Unlike the passenger car, the mounting position of camera in case of the medium and heavy duty commercial vehicle is relatively high, it is disadvantaged conditions for lane recognition in contradiction to passenger car. In this work, we show the method of lane recognition through the Sobel edge, based on the spatial domain processing, Hough transform and color conversion correction. Also we suggest the low error method of front vehicles recognition in order to reduce the detection error through Haar-like, Adaboost, SVM and Template matching, etc., which are the object recognition methods by frontal camera vision. It is verified that the reliability over 98% on lane recognition is obtained through the vehicle test.

Research on Human Posture Recognition System Based on The Object Detection Dataset (객체 감지 데이터 셋 기반 인체 자세 인식시스템 연구)

  • Liu, Yan;Li, Lai-Cun;Lu, Jing-Xuan;Xu, Meng;Jeong, Yang-Kwon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.1
    • /
    • pp.111-118
    • /
    • 2022
  • In computer vision research, the two-dimensional human pose is a very extensive research direction, especially in pose tracking and behavior recognition, which has very important research significance. The acquisition of human pose targets, which is essentially the study of how to accurately identify human targets from pictures, is of great research significance and has been a hot research topic of great interest in recent years. Human pose recognition is used in artificial intelligence on the one hand and in daily life on the other. The excellent effect of pose recognition is mainly determined by the success rate and the accuracy of the recognition process, so it reflects the importance of human pose recognition in terms of recognition rate. In this human body gesture recognition, the human body is divided into 17 key points for labeling. Not only that but also the key points are segmented to ensure the accuracy of the labeling information. In the recognition design, use the comprehensive data set MS COCO for deep learning to design a neural network model to train a large number of samples, from simple step-by-step to efficient training, so that a good accuracy rate can be obtained.

Fault Tolerant Clock Management Scheme in Sensor Networks (센서 네트워크에서 고장 허용 시각 관리 기법)

  • Hwang So-Young;Baek Yun-Ju
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.9A
    • /
    • pp.868-877
    • /
    • 2006
  • Sensor network applications need synchronized time to the highest degree such as object tracking, consistent state updates, duplicate detection, and temporal order delivery. In addition, reliability issues and fault tolerance in sophisticated sensor networks have become a critical area of research today. In this paper, we proposed a fault tolerant clock management scheme in sensor networks considering two cases of fault model such as network faults and clock faults. The proposed scheme restricts the propagation of synchronization error when there are clock faults of nodes such as rapid fluctuation, severe changes in drift rate, and so on. In addition, it handles topology changes. Simulation results show that the proposed method has about $1.5{\sim}2.0$ times better performance than TPSN in the presence of faults.

Global Time Synchronization for Wireless Sensor Networks (무선 센서 네트워크를 위한 전역 시각 동기 기법)

  • Hwang, So-Young;Yu, Don-Hui;Joo, Jae-Heum;Won, Sung-Hyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.05a
    • /
    • pp.84-86
    • /
    • 2010
  • Time information and time synchronization are fundamental building blocks in wireless sensor networks since many sensor network applications need time information for object tracking, consistent state updates, duplicate detection and temporal order delivery. Various time synchronization protocols have been proposed for sensor networks because of the characteristics of sensor networks which have limited computing power and resources. However, none of these protocols have been designed with time representation scheme in mind. Global time format such as UTC TOD (Universal Time Coordinated, Time Of Day) is very useful in sensor network applications. In this paper we propose time keeping and synchronization method for global time presentation in wireless sensor networks.

  • PDF