Browse > Article
http://dx.doi.org/10.7472/jksii.2014.15.4.09

Bio-mimetic Recognition of Action Sequence using Unsupervised Learning  

Kim, Jin Ok (Faculty of Mobile Contents, Daegu Haany University)
Publication Information
Journal of Internet Computing and Services / v.15, no.4, 2014 , pp. 9-20 More about this Journal
Abstract
Making good predictions about the outcome of one's actions would seem to be essential in the context of social interaction and decision-making. This paper proposes a computational model for learning articulated motion patterns for action recognition, which mimics biological-inspired visual perception processing of human brain. Developed model of cortical architecture for the unsupervised learning of motion sequence, builds upon neurophysiological knowledge about the cortical sites such as IT, MT, STS and specific neuronal representation which contribute to articulated motion perception. Experiments show how the model automatically selects significant motion patterns as well as meaningful static snapshot categories from continuous video input. Such key poses correspond to articulated postures which are utilized in probing the trained network to impose implied motion perception from static views. We also present how sequence selective representations are learned in STS by fusing snapshot and motion input and how learned feedback connections enable making predictions about future input sequence. Network simulations demonstrate the computational capacity of the proposed model for motion recognition.
Keywords
Action sequence detection; Categorical action classification; Bio-inspired Action perception; Unsupervised learning;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 H. Jhuang, T. Serre, L. Wof, T. Poggio, "A Biologically inspired system for Action Recognition," ICCV(Inter. Conf. Com. Vision), 2007, pp. 1-8.
2 R. J. Peters, L. Itti, "Beyond bottom-up : Incorporating task-dependent influences into a computational model of spatial attention," IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1-8.
3 V. Navalpakkam, L, Itti, "An Integrated Model of Top-down and Bottom-up Attention for Optimal Object Detection, CVPR(Computer Vision and Pattern Recognition), 2006, pp. 2049-2056.
4 M. A. Giese, T. Poggio. "Neural mechanisms for the recognition of biological movements." Nat. Rev. Neurosci, vol. 4, pp. 179-192, 2003.   DOI
5 A. Casile, M. A. Giese. "Roles of motion and form in biological motion recognition." Lecture Notes in Computer Science, vol. 2714, pp. 854-862, 2003.
6 H. Jhuang, T. Serre, L. Wolf, T. Poggio. "A biologically inspired system for action recognition." IEEE Intl. Conf. on Computer Vision, 2007, pp. 14-20,
7 K. Schindler, L. Van Gool. "Action snippets: How many frames does human action recognition require?" IEEE Conf. on Computer Vision and Pattern Recognition, 2008, pp. 22-24.
8 M. J. Escobar, G. S. Masson, T. Vieville, P. Kornprobst. "Action recognition using a bio-inspired feedforward spiking network." Intl. Journal of Computer Vision, vol. 83, 2009, pp. 284-301.
9 C. M. Bishop. "Neural Networks for Pattern Recognition." Oxford University Press, 1995.
10 E. H. Adelson, J. Bergen. "Spatiotemporal energy models for the perception of motion," Journal of the Optical Society of America, vol. 2, no. 7, pp. 284-299, 1985.   DOI
11 T. Jellema, D.I. Perrett. "Cells in monkey STS responsive to articulated body motions and consequent static posture: a case of implied motion? "Neuropsychologia, Vol. 41, pp.1728-1737, 2003.   DOI   ScienceOn
12 Z. Kourtzi, N. Kanwisher. "Activation of human MT/MST by static images with implied motion." Journal of Cognitive Neuroscience, vol. 12, pp.48-53, 2000.   DOI
13 J. P. Van Santen, G. Sperling. "Temporal covariance model of human motion perception," Journal of the Optical Society of America, vol. 1, pp. 451, 1984.   DOI
14 R. A. Brooks. "A robot that walks; emergent behaviors from a carefully evolved network," Neural Computation, vol. 1, no. 2, pp. 253-262, 1989.   DOI
15 R. Sekuler, S. N. J. Watamaniuk, R. Blake. "Motion Perception," Steven's Handbook of Experimental Psychology, vol. 1, pp. 121-176, 1998.
16 W. A. Fellez, J. G. Talyor. "Establishing retinotopy by lateral-inhibition type homogeneous neural fields," Neurocomputing, vol. 48, pp. 313-322, 2002.   DOI
17 E. Mingolla. "Neural models of motion integration and segmentation," Neural Networks, vol. 16, pp. 939-945, 2003.   DOI
18 C. Pack, S. Grossberg, E. Mingolla, "A neural model of smooth pursuit control and motion perception by cortical area MST," Technical Report CAS/CNR-TR 99-023, Department of Cognitive and neural systems and Center for Adaptive Systems, MIT, 2000.
19 M. J. Escobar, G. S. Masson, T. Vieville, P. Kornprobst, "Action recognition using a bio-inspired feedforward speaking network," International Journal of Computer vision, vol. 82. no. 3, pp. 284-297, 2009.   DOI
20 D. H. Hubel, T. N. Weisel. "Receptive Fields, binocular iteration and functional architecture in the cats visual cortex, "Journal of Physiology, vol. 160, pp. 106-154, 1962.   DOI
21 A. M. Derrington, G. B. Henning. "Detecting and discriminating the direction of motion of luminance and colour grattings," Visual Research, vol. 33, pp. 799-811, 1993.
22 G. A. Carpenter, S. Grossberg. "Pattern recognition by self-organizing," Neural networks. MIT Press, 1991.
23 Jinok Kim, "Agent's Activities based Intention Recognition Computing", Journal of Korean Internet Society, vol. 13, no. 2, pp. 87-98, 2012.   과학기술학회마을   DOI
24 M. W. Oram, D. I. Perrett. "Integration of form and motion in the anterior superior temporal polysensory area (STPA) of the macaque monkey," Journal of Neurophysiology, vol. 76, no. 1, pp. 109-129, 1996.   DOI
25 J. M. Singer, D. L. Sheinberg, "Temporal cortex neurons encode articulated actions as slow sequences of integrated poses," Journal of Neuroscience, vol. 30, no. 8, pp. 3133-3145, 2008.
26 Jinok Kim, "A Study on Visual Perception based Emotion Recognition using Body-Activity Posture," The KIPS Transactions, Part B, vol. 18, no. 5, pp. 305-314, 2011.   과학기술학회마을   DOI
27 P. Foldiak. "Learning invariances from transformation sequences." Neural Computation, vol. 3, pp. 194-200, 1991.   DOI
28 Jinok Kim, "Effective Pose-based Approach with Pose Estimation for Emotional Action Recognition", The KIPS Transactions: Part B, vol. 2, no. 3, pp. 1-10, 2013.
29 B. McCane, K. Nobings, D. Grannitch, B. Galvin. "On benchmarking optical flow." Computer Vision and Image Understanding, 2003, pp. 126-143,
30 Jeff Hawkins, Sandra Baksesless, "On Intelligence-How a new understanding of the brain will lead to the creation of truly intelligent machines", Owl Books, NY, 2005.