비디오기반 행동인식 연구 동향

  • Published : 2017.08.25

Abstract

Keywords

References

  1. I. Lillo et al. "Sparse composition of body poses and atomic actions for human activity recognition in RGB-D videos," Image and Vision Computing, vol. 59, pp. 63-75, 2017. https://doi.org/10.1016/j.imavis.2016.11.004
  2. H.S. Min et al. "Sparse representation-based human action recognition using an action region-aware dictionary," EEEISM, 2013.
  3. I. Laptev et al. "On Space-Time Interest Points," Int. J. of Computer Vision, vol 64, pp.107-123, 2005. https://doi.org/10.1007/s11263-005-1838-7
  4. P. Dollar, et al. "Behavior recognition via sparse spatio-temporal features," VS-PETS, 2005.
  5. G. Willems, et al. "An efficient dense and scale invariant spatio-temporal interest point detector," ECCV, 2008.
  6. I. Laptev et al. "Learning realistic human actions from movies," CVPR, 2008.
  7. A. Klaeseret al. "A spatio-temporal descriptor based on 3D-gradients," BMVC 2008.
  8. H. Wang et al. "Evaluation of local spatio-temporal features for action recognition," BMVC, 2009.
  9. V. Delaitre et al. "Recognizing human action in still images: a study of bag-of-features and partial-based representations," BMVC, 2010.
  10. 홍준혁 외 "가중치 기반 Bag-of-Feature와 앙상블 결정트리를 이용한 정지 영상에서의 인간행동 인식," 한국통신학회논문지, 2013.
  11. L. Breiman "Random forests," Machine Learning, vol. 45, pp. 5-32, 2001. https://doi.org/10.1023/A:1010933404324
  12. K. Simonyan et al. "Two-stream convolutional networks for action recognition in videos," NIPS, 2014.
  13. S. Ji et al. "3d convolutional neural networks for human action recognition," IEEE Transaction PAMI, vol. 35, pp.221-231, 2013. https://doi.org/10.1109/TPAMI.2012.59
  14. D. Tran et al. "Learning spatiotemporal features with 3D convolutional networks," ICCV, 2015.
  15. A. Alahi, "Social LSTM: Human Trajectory Prediction in Crowded Spaces," CVPR, 2016.
  16. S. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," Neural Computation, vol.9, pp. 1735-1780, 1997. https://doi.org/10.1162/neco.1997.9.8.1735
  17. J. Donahue et al, "Long-term Recurrent Convolutional Networks for Visual Recognition and Description," Berkeley Tech. Report, 2014.
  18. X. Wang et al. "Beyond Frame-level CNN: Saliency-aware 3D CNN with LSTM for Video Action Recognition," IEEE Sig. Processing Letters, 2016.
  19. M. S. Ibrahim et al. "A hierarchical deep temporal model for group activity recognition," CVPR, 2016.