• 제목/요약/키워드: motion descriptor

검색결과 38건 처리시간 0.024초

An Automatic Camera Tracking System for Video Surveillance

  • Lee, Sang-Hwa;Sharma, Siddharth;Lin, Sang-Lin;Park, Jong-Il
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2010년도 하계학술대회
    • /
    • pp.42-45
    • /
    • 2010
  • This paper proposes an intelligent video surveillance system for human object tracking. The proposed system integrates the object extraction, human object recognition, face detection, and camera control. First, the object in the video signals is extracted using the background subtraction. Then, the object region is examined whether it is human or not. For this recognition, the region-based shape descriptor, angular radial transform (ART) in MPEG-7, is used to learn and train the shapes of human bodies. When it is decided that the object is human or something to be investigated, the face region is detected. Finally, the face or object region is tracked in the video, and the pan/tilt/zoom (PTZ) controllable camera tracks the moving object with the motion information of the object. This paper performs the simulation with the real CCTV cameras and their communication protocol. According to the experiments, the proposed system is able to track the moving object(human) automatically not only in the image domain but also in the real 3-D space. The proposed system reduces the human supervisors and improves the surveillance efficiency with the computer vision techniques.

  • PDF

실시간 파노라마 영상 합성을 위한 고속 특징점 추적 기법 (Fast Keypoint Tracking for Panoramic Image Alignment)

  • 이현준;심학준;이상화;조남익;이상욱
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2010년도 하계학술대회
    • /
    • pp.40-41
    • /
    • 2010
  • 실시간 파노라마 영상 합성 기술을 모바일 콘텐츠에 적용하기 위해서는 모바일 디바이스에 적합한 영상 합성 방식을 정의해야 한다. 특징점을 추출하여 연속적인 영상들 사이의 유사성을 찾아내는 방식 [1]은 소모되는 연산비용을 고려할 때 모바일 디바이스에 응용하기 부적합하다. 따라서 기존의 접근방식에 비해 소요되는 연산비용을 줄이고, 성능은 유지하는 기법이 필요하다. 본 논문에서는 특징점의 표현자(descriptor)를 생성하지 않고 특징점(feature point) 주변의 움직임을 추정(motion estimation)하여 연속적인 영상의 위치를 찾아내는 기법을 제안한다.

  • PDF

방범용 CCTV를 위한 배회행위 탐지 솔루션 (Loitering Detection Solution for CCTV Security System)

  • 강주형;곽수영
    • 한국멀티미디어학회논문지
    • /
    • 제17권1호
    • /
    • pp.15-25
    • /
    • 2014
  • 본 논문에서는 지능형 감시 시스템을 위해 공간적 확률 분포와 방향 서술자를 이용하여 다양한 배회행위를 검출하는 방법을 제안한다. 적응적 배경 모델링 기법을 이용하여 움직이는 객체를 검출하고, 검출된 객체로부터 움직임의 정보를 추출한다. 추출된 객체의 움직임 정보는 이동 궤적과 방향에 대해 특징벡터를 생성한다. 생성된 특징벡터는 k-Nearest Neighbor를 통해 최종적으로 배회행위를 검출하게 된다. 제안한 방법을 실내외 다양한 환경에서 테스트하여 배회 행위를 검출하는 결과를 나타내었으며 이는 실시간으로 검출되는 것을 확인하였다.

Human Action Recognition Using Pyramid Histograms of Oriented Gradients and Collaborative Multi-task Learning

  • Gao, Zan;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제8권2호
    • /
    • pp.483-503
    • /
    • 2014
  • In this paper, human action recognition using pyramid histograms of oriented gradients and collaborative multi-task learning is proposed. First, we accumulate global activities and construct motion history image (MHI) for both RGB and depth channels respectively to encode the dynamics of one action in different modalities, and then different action descriptors are extracted from depth and RGB MHI to represent global textual and structural characteristics of these actions. Specially, average value in hierarchical block, GIST and pyramid histograms of oriented gradients descriptors are employed to represent human motion. To demonstrate the superiority of the proposed method, we evaluate them by KNN, SVM with linear and RBF kernels, SRC and CRC models on DHA dataset, the well-known dataset for human action recognition. Large scale experimental results show our descriptors are robust, stable and efficient, and outperform the state-of-the-art methods. In addition, we investigate the performance of our descriptors further by combining these descriptors on DHA dataset, and observe that the performances of combined descriptors are much better than just using only sole descriptor. With multimodal features, we also propose a collaborative multi-task learning method for model learning and inference based on transfer learning theory. The main contributions lie in four aspects: 1) the proposed encoding the scheme can filter the stationary part of human body and reduce noise interference; 2) different kind of features and models are assessed, and the neighbor gradients information and pyramid layers are very helpful for representing these actions; 3) The proposed model can fuse the features from different modalities regardless of the sensor types, the ranges of the value, and the dimensions of different features; 4) The latent common knowledge among different modalities can be discovered by transfer learning to boost the performance.

Human Action Recognition Via Multi-modality Information

  • Gao, Zan;Song, Jian-Ming;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
    • Journal of Electrical Engineering and Technology
    • /
    • 제9권2호
    • /
    • pp.739-748
    • /
    • 2014
  • In this paper, we propose pyramid appearance and global structure action descriptors on both RGB and depth motion history images and a model-free method for human action recognition. In proposed algorithm, we firstly construct motion history image for both RGB and depth channels, at the same time, depth information is employed to filter RGB information, after that, different action descriptors are extracted from depth and RGB MHIs to represent these actions, and then multimodality information collaborative representation and recognition model, in which multi-modality information are put into object function naturally, and information fusion and action recognition also be done together, is proposed to classify human actions. To demonstrate the superiority of the proposed method, we evaluate it on MSR Action3D and DHA datasets, the well-known dataset for human action recognition. Large scale experiment shows our descriptors are robust, stable and efficient, when comparing with the-state-of-the-art algorithms, the performances of our descriptors are better than that of them, further, the performance of combined descriptors is much better than just using sole descriptor. What is more, our proposed model outperforms the state-of-the-art methods on both MSR Action3D and DHA datasets.

Antiblurry Dejitter Image Stabilization Method of Fuzzy Video for Driving Recorders

  • Xiong, Jing-Ying;Dai, Ming;Zhao, Chun-Lei;Wang, Ruo-Qiu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권6호
    • /
    • pp.3086-3103
    • /
    • 2017
  • Video images captured by vehicle cameras often contain blurry or dithering frames due to inadvertent motion from bumps in the road or by insufficient illumination during the morning or evening, which greatly reduces the perception of objects expression and recognition from the records. Therefore, a real-time electronic stabilization method to correct fuzzy video from driving recorders has been proposed. In the first stage of feature detection, a coarse-to-fine inspection policy and a scale nonlinear diffusion filter are proposed to provide more accurate keypoints. Second, a new antiblurry binary descriptor and a feature point selection strategy for unintentional estimation are proposed, which brought more discriminative power. In addition, a new evaluation criterion for affine region detectors is presented based on the percentage interval of repeatability. The experiments show that the proposed method exhibits improvement in detecting blurry corner points. Moreover, it improves the performance of the algorithm and guarantees high processing speed at the same time.

내용기반 비디오 검색을 위한 MPEG-7 비주얼 디스크립터 분석 (The Analysis of Visual Descriptors for Content-based Video Retrieval)

  • 김성희
    • 한국비블리아학회지
    • /
    • 제16권2호
    • /
    • pp.157-175
    • /
    • 2005
  • 본 논문의 목적은 멀티미디어 내용을 표현하기 위해 제정된 MPEG-7의 비주얼 디스크립터를 설명하고 분석하기 위한 것이다. MPEG-7의 비주얼 디스크립터는 컬러, 모양, 질감, 모션으로 구성되어 있으며 이들 각각의 요소들에 대해 예제 및 응용분야와 더불어서 자세히 설명하고 있다. 마지막으로 본 연구에서는 이들 비주얼 디스크립터에 대한 전반적인 응용분야를 기술하고 있다. 이러한 비주얼 디스크립터들은 다양한 모양의 멀티미디어 내용을 인간의 개입 없이 자동으로 풍부하고 상세하게 표현할 수 있으며 내용 표현의 일관성을 유지하게 함으로써 멀티미디어 검색의 성능을 향상시킬 뿐 아니라 시스템간의 상호운영성을 향상 시킬 수 있을 것으로 기대된다.

  • PDF

감시 영상에서 군중의 탈출 행동 검출 (Detection of Crowd Escape Behavior in Surveillance Video)

  • 박준욱;곽수영
    • 한국통신학회논문지
    • /
    • 제39C권8호
    • /
    • pp.731-737
    • /
    • 2014
  • 본 논문에서는 감시 카메라 환경에서 발생할 수 있는 군중의 비정상 행동 검출 방법을 제안한다. 군중들의 비정상 행동을 산발적으로 퍼지면서 뛰는 행동, 한쪽 방향으로 갑자기 뛰는 행동 두 가지로 정의하였다. 이를 검출하기 위하여 영상에서 움직임 벡터를 추출하여 군중의 비정상 행동 검출에 적합한 서술자 MHOF(Multi-scale Histogram of Optical Flow)와 DCHOF(Directional Change Histogram of Optical Flow)제안하였으며, 이를 이진 분류기인 SVM(Support Vector Machine)을 이용하여 검출하였다. 제안한 방법은 공개 데이터셋인 UMN 데이터와 PETS 2009 데이터를 이용하여 성능을 평가하였고 다른 방법론과의 비교를 통해 제안하는 알고리즘의 우수성을 입증하였다.