DOI QR코드

DOI QR Code

Silhouette-Edge-Based Descriptor for Human Action Representation and Recognition

  • Odoyo, Wilfred O. (Department of Computer Engineering, Chosun University) ;
  • Choi, Jae-Ho (Department of Computer Information, Songwon University) ;
  • Moon, In-Kyu (Department of Computer Engineering, Chosun University) ;
  • Cho, Beom-Joon (Department of Computer Engineering, Chosun University)
  • Received : 2012.11.02
  • Accepted : 2013.02.08
  • Published : 2013.06.30

Abstract

Extraction and representation of postures and/or gestures from human activities in videos have been a focus of research in this area of action recognition. With various applications cropping up from different fields, this paper seeks to improve the performance of these action recognition machines by proposing a shape-based silhouette-edge descriptor for the human body. Information entropy, a method to measure the randomness of a sequence of symbols, is used to aid the selection of vital key postures from video frames. Morphological operations are applied to extract and stack edges to uniquely represent different actions shape-wise. To classify an action from a new input video, a Hausdorff distance measure is applied between the gallery representations and the query images formed from the proposed procedure. The method is tested on known public databases for its validation. An effective method of human action annotation and description has been effectively achieved.

Keywords

References

  1. A. Agarwal and B. Triggs, "3D human pose from silhouettes by relevance vector regression," in Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, Washington: DC, pp. 882-888, 2004.
  2. Lv. Fengjun and R. Nevatia, "Single view human action recognition using key pose matching and Viterbi path searching," in Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis: MN, pp. 1-8, 2007.
  3. P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie, "Behavior recognition via sparse spatio-temporal features," in Proceedings of the 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, Beijing, China, pp. 65-72, 2005.
  4. Y. Ke and R. Sukthankar, "PCA-SIFT: a more distinctive representation for local image descriptors," in Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, Washington: DC, pp. 506-513, 2004.
  5. M. J. Black, "Explaining optical flow events with parameterized spatio-temporal models," in Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, Fort Collins: CO, pp. 326-332, 1999.
  6. N. Dala and B. Triggs, "Histograms of oriented gradients for human detection," in Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, San Diego: CA, pp. 886-893, 2005.
  7. C. E. Shannon, "A mathematical theory of communication," Bell System Technical Journal, vol. 27, pp. 379-423, 1948. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
  8. S. Belongie, J. Malik, and J. Puzicha, "Shape matching and object recognition using shape contexts," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 4, pp. 509-522, 2002. https://doi.org/10.1109/34.993558
  9. D. P. Hutternlocher, G. A. Klanderman, and W. J. Rucklidge, "Comparing images using the Hausdorff distance," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 9, pp. 850-863, 1993. https://doi.org/10.1109/34.232073
  10. X. W. Chen and T. Huang, "Facial expression recognition: a clustering-based approach," Pattern Recognition Letter, vol. 24, no. 9-10, pp. 1295-1302, 2003. https://doi.org/10.1016/S0167-8655(02)00371-9
  11. R. C. Gonzalez, R. E. Woods, and S. L. Eddins, Digital Image Processing Using MATLAB. India, Dorling Kindersley, 2004.
  12. M. P. Dubuisson and A. K. Jain, "A modified Hausdorff distance for object matching," in Proceedings of the 12th IAPR International Conference on Pattern Recognition, Jerusalem, Israel, pp. 566-568, 1994.
  13. D. G. Sim, O. K. Kwon, and R. H. Park, "Object matching algorithms using robust Hausdorff distance measures," in IEEE Transactions on Image Processing, vol. 8, no. 3, pp. 425-428, 1999. https://doi.org/10.1109/83.748897
  14. M. Blank, L. Gorelick, E. Shechtman, M. Irani, and R. Basri, "Actions as space-time shapes," in Proceeding of the 10th IEEE Conference on Computer Vision, Beijing, China, pp. 1395-1402, 2005.
  15. G. B. Lee, W. O. Odoyo, J. N. Yeom, and B. J. Cho, "Extraction of key postures using shape contexts," in Proceeding of the 11th IEEE Conference on Advanced Communication Technology, Phoenix Park, Ireland, pp. 1311-1314, 2009.
  16. N. Ikizler and P. Duygulu, "Histogram of oriented rectangles: a new pose descriptor for human action recognition," Image and Vision Computing, vol. 27, no. 10, pp. 1515-1526, 2009. https://doi.org/10.1016/j.imavis.2009.02.002
  17. W. O. Odoyo, "Silhouette edge-based log-polar descriptor for human action representation and recognition," Ph.D. dissertation, Chosun University, Gwangju, Korea, 2012.

Cited by

  1. Fall-Detection Algorithm Using 3-Axis Acceleration: Combination with Simple Threshold and Hidden Markov Model vol.2014, pp.1687-0042, 2014, https://doi.org/10.1155/2014/896030