Efficient Representation and Matching of Object Movement using Shape Sequence Descriptor

Choi, Min-Seok;

doi:10.3745/KIPSTB.2008.15-B.5.391

The KIPS Transactions:PartB (정보처리학회논문지B)

Volume 15B Issue 5
/
Pages.391-396
/
2008
/
1598-284X(pISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Efficient Representation and Matching of Object Movement using Shape Sequence Descriptor

모양 시퀀스 기술자를 이용한 효과적인 동작 표현 및 검색 방법

Choi, Min-Seok

최민석 (삼육대학교 경영정보학과)

Published : 2008.10.31

https://doi.org/10.3745/KIPSTB.2008.15-B.5.391 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Motion of object in a video clip often plays an important role in characterizing the content of the clip. A number of methods have been developed to analyze and retrieve video contents using motion information. However, most of these methods focused more on the analysis of direction or trajectory of motion but less on the analysis of the movement of an object itself. In this paper, we propose the shape sequence descriptor to describe and compare the movement based on the shape deformation caused by object motion along the time. A movement information is first represented a sequence of 2D shape of object extracted from input image sequence, and then 2D shape information is converted 1D shape feature using the shape descriptor. The shape sequence descriptor is obtained from the shape descriptor sequence by frequency transform along the time. Our experiment results show that the proposed method can be very simple and effective to describe the object movement and can be applicable to semantic applications such as content-based video retrieval and human movement recognition.

동영상에서 객체의 움직임은 동영상의 내용을 분석하는데 중요한 요소로 작용한다. 따라서 움직임 정보를 이용하여 동영상 내용을 분석하고 검색하기 위한 많은 방법들이 제안되었다. 그러나 대부분의 방법들은 객체 자체의 동작 보다는 움직임의 방향이나 경로를 분석하는 쪽으로 치중되었다. 본 논문에서는 객체의 움직임에 의한 모양 변화를 이용하여 객체의 동작을 표현하고 비교하기 위한 모양 시퀀스 기술자(descriptor)를 제안한다. 객체의 움직임 정보는 입력된 이미지 시퀀스에서 객체 영역을 추출하여 연속된 2차원 모양 정보로 표현되고, 각각의 2차원 모양 정보는 모양 기술자를 이용하여 1차원 모양 특징 값으로 변환된다. 순서에 따라 배열된 모양 기술자들을 시간 축으로 주파수 변환한 후 저주파영역의 계수를 취하여 모양 시퀀스 기술자를 얻게 된다. 실험을 통하여 제안된 방법이 객체의 동작 정보를 매우 효과적으로 표현 및 비교 가능하여 내용 기반 동영상 검색, 동작 인식 등의 인지적 관점의 움직임 분석 응용에 적용 가능함을 보였다.

Keywords

References

S. F. Chang et al, “A Fully Automated Content-Based Video Search Engine Supporting Multi-Objects Spatio-Temporal Queries,” IEEE Transaction on Circuit and Systems for Video Technology, Vol.8, No.5, pp.602-615, 1998 https://doi.org/10.1109/76.718507
Y. P Tan, S. R. Kulkarni and P. J Ramadge, “Rapid estimation of camera motion from compressed video with application to video annotation,” IEEE Transaction on Circuit and Systems for Video Technology, Vol.10, No.1, pp.133-146, 2000 https://doi.org/10.1109/76.825867
J. Aggarwal and Q. Cai, “Human Motion Analysis: A review,” Computer Vision and Image Understanding, Vol.73, No.3, pp.428-440, 1999 https://doi.org/10.1006/cviu.1998.0744
Aron F. Bobick and James W. Davis, “The Recognition of Human Movement Using Temporal Templates,” IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.23, No.3, pp.257-267, 2001 https://doi.org/10.1109/34.910878
J. A. Webb and J. K. Aggarwal, Visually interpreting the motion of objects in space, IEEE Computer, Vol.14, No.8, pp.40-46, 1981 https://doi.org/10.1109/C-M.1981.220561
F. J. Perales and J. Torres, “A system for human motion matching between synthetic and real image based on a biomechanic graphical model,” Proceedings of IEEE Computer Society Workshop on Motion of Non-Rigid and Articulated Objects, pp.83-88, November 1994 https://doi.org/10.1109/MNRAO.1994.346263
J. Rehg and T. Kanade, “Model-Based Tracking of Self-Occluding Articulated Objects,” Proceedings of Int'l Conference on Computer Vision, pp.612-617, August 1995 https://doi.org/10.1109/ICCV.1995.466882
L. Goncalves, E. DiBernardo, E. Ursella and P. Perona, “Monocular Tracking of the Human Arm in 3D,” Proceedings of Int'l Conference on Computer Vision, pp.764-770, August 1995 https://doi.org/10.1109/ICCV.1995.466861
D. Hogg, “Model-Based Vision: A Paradigm to See a Walking Person,” Image and Vision Computing, Vol.1, No.1, pp.5-20, 1983 https://doi.org/10.1016/0262-8856(83)90003-3
K. Rohr, “Towards Model-Based Recognition of Human Movements in Image Sequences,” CVGIP, Image Understanding, Vol.59, No.1, pp.94-115, 1994 https://doi.org/10.1006/ciun.1994.1006
K. Akita, “Image Sequence Analysis of Real World Human Motion,” Pattern Recognition, Vol.17, No.1, pp.73-83, 1984 https://doi.org/10.1016/0031-3203(84)90036-0
T. Darrell and A. Pentland, “Space-Time Gestures,” Proceedings of Computer Vision and Pattern Recognition, pp.335-340, 1993 https://doi.org/10.1109/CVPR.1993.341109
Y. Cui, D. Swets and J. Weng, “Learning-Based Hand Sign Recognition Using Shoslif-m,” Proceedings of Int'l Conference on Computer Vision, pp.631-636, August 1995 https://doi.org/10.1109/ICCV.1995.466879
J. Yamato, J. Ohya and K. Ishii, “Recognizing Human Action in Time Sequential Images Using Hidden Markov Models,” Proceedings of Computer Vision and Pattern Recognition, pp.379-385, 1992 https://doi.org/10.1109/CVPR.1992.223161
A. Wilson and A. Bobick, “Learning Visual Behavior for Gesture Analysis,” Proceedings of IEEE Int'l. Symposium on Computer Vision, November 1995 https://doi.org/10.1109/ISCV.1995.477006
J. Little and J. Boyd, “Describing Motion for Recognition,” Proceedings of IEEE Int'l Symposium on Computer Vision, pp.235-240, November 1995 https://doi.org/10.1109/ISCV.1995.477007
R. Polana and R. Nelson, “Low Level Recognition of Human Motion,” Proceedings of IEEE Workshop Non- Rigid and Articulated Motion, pp.77-82, 1994 https://doi.org/10.1109/MNRAO.1994.346251
E. Shavit and A. Jepson, “Motion Understanding Using Phase Portraits,” Proceedings of IJCAI Workshop: Looking at People, pp.101-108, 1993
J. M. Siskind, “Grounding Language in Perception,” Artificial Intelligence Rev., Vol.8, pp.371-391, 1995 https://doi.org/10.1007/BF00849726
Y. Yacoob and L. Davis, “Recognizing Human Facial Expressions Form Long Image Sequences Using Optical Flow,” IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.18, No.6, pp.636-642, 1996 https://doi.org/10.1109/34.506414
I. Essa and A. Pentland, “Coding, Analysis, Interpretation, and Recognition of Facial Expressions,” IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.19, No.7, pp.757-763, 1997 https://doi.org/10.1109/34.598232
Ashok Veeraraghavan, Amit K. Roy-Chowdhury and Rama Chellappa, “Matching Shape Sequences in Video with Applications in Human Movement Analysis,” IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.27, No.12, pp.1896-1909, 2005 https://doi.org/10.1109/TPAMI.2005.246
MPEG-7 Visual Group, “Text of ISO/IEC 15938-3/FDIS Information technology - Multimedia content description interface - Part 3 Visual”, ISO/IEC JTC1/ SC29/WG11 N4358, Sydney, July 2001
B. S. Manjunath, Philippe Salembier and Thomas Sikora, Introduction to MPEG-7: multimedia content description interface, West Sussex, England: John Wiley & Sons, 2002
Min-seok Choi and Whoi-yul Kim, “The description and retrieval of a sequence of moving objects using Shape Variation Map,” Patten Recognition Letters, Vol.25, issue 12, pp.1369-1375, 2004 https://doi.org/10.1016/j.patrec.2004.05.010
MPEG-7 Visual Group, “Descriptor of Core Experiments for MPEG-7 Color/Texture Descriptors,” ISO/IEC JTC1/ SC29/WG11 N2929, Melbourne, October 1999

The KIPS Transactions:PartB (정보처리학회논문지B)

Efficient Representation and Matching of Object Movement using Shape Sequence Descriptor

모양 시퀀스 기술자를 이용한 효과적인 동작 표현 및 검색 방법

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)