DOI QR코드

DOI QR Code

Design of the 3D Object Recognition System with Hierarchical Feature Learning

계층적 특징 학습을 이용한 3차원 물체 인식 시스템의 설계

  • 김주희 (경기대학교 컴퓨터과학과) ;
  • 김동하 (경기대학교 컴퓨터과학과) ;
  • 김인철 (경기대학교 컴퓨터과학과)
  • Received : 2015.06.22
  • Accepted : 2015.11.14
  • Published : 2016.01.31

Abstract

In this paper, we propose an object recognition system that can effectively find out its category, its instance name, and several attributes from the color and depth images of an object with hierarchical feature learning. In the preprocessing stage, our system transforms the depth images of the object into the surface normal vectors, which can represent the shape information of the object more precisely. In the feature learning stage, it extracts a set of patch features and image features from a pair of the color image and the surface normal vector through two-layered learning. And then the system trains a set of independent classification models with a set of labeled feature vectors and the SVM learning algorithm. Through experiments with UW RGB-D Object Dataset, we verify the performance of the proposed object recognition system.

본 논문에서는 계층적 특징 학습을 이용하여 물체의 컬러 영상과 깊이 영상으로부터 해당 물체가 속한 범주와 개체, 그리고 다양한 속성들을 효과적으로 인식할 수 있는 시스템을 제안한다. 본 시스템의 전처리 단계에서는 물체의 깊이 영상을 물체의 모양 정보를 좀 더 효과적으로 표현할 수 있는 표면 법선 벡터 데이터로 변환하고, 특징 학습 단계에서는 물체의 컬러 영상과 표면 법선 벡터 데이터로부터 두 단계에 걸쳐 패치 단위 특징과 이미지 단위의 특징을 추출해낸다. 그리고 추출된 특징 벡터들과 SVM 학습 알고리즘을 이용하여 각기 독립적인 다수의 분류 모델들을 학습한다. 미국 워싱턴 대학의 RGB-D 물체 데이터 집합을 이용한 실험을 통해, 본 논문에서 제안하는 물체 인식 시스템의 높은 성능을 확인할 수 있었다.

Keywords

References

  1. D. Paulk, V. Metsis, C. McMurrough, and F. Makedon, "A supervised learning approach for fast object recognition from RGB-D data," Proc. of ACM Intl. Conf. on PErvasive Technologies Related to Assistive Environments, p.5, 2014.
  2. Y. Cheng, X. Zhao, K. Huang, and T. Tan, "Semi-supervised Learning For RGB-D Object Recognition," Proc. of IEEE Intl. Conf. on Pattern Recognition, pp.2377-2382, 2014.
  3. L. Bo, X. Ren, and D. Fox, "Learning hierarchical sparse features for RGB-(D) object recognition," International Journal of Robotics Research, Vol.33, No.4, pp.581-599, 2014. https://doi.org/10.1177/0278364913514283
  4. C. H. Lampert, H. Nickisch, and S. Harmeling, "Attribute-based classification for zero-shot visual object categorization," Proc. of IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.36, pp.453-465. 2014. https://doi.org/10.1109/TPAMI.2013.140
  5. A. Farhadi, I. Endres, D, Hoiem, and D. Forsyth, "Describing objects by their attributes," Proc. of IEEE Intl. Conf. on Computer Vision and Pattern Recognition, pp.1778-1785, 2009.
  6. D. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, Vol.60, No.2, pp.91-110, 2004. https://doi.org/10.1023/B:VISI.0000029664.99615.94
  7. N. Dalal, and B. Triggs, "Histograms of oriented gradients for human detection," Proc. of IEEE Conf. Computer Vision and Pattern Recognition, Vol.1, pp.886-893, 2005.
  8. R. B. Rusu, N. Blodow, and M. Beetz, "Fast point feature histograms (FPFH) for 3D registration," Proc. of IEEE Intl. Conf. on Robotics and Automation, pp.3212-3217, 2009.
  9. S. Lawrence, C. L. Giles, A. C. Tsoi, and A. D. Back, "Face recognition: A convolutional neural-network approach," IEEE Transactions on Neural Networks, Vol.8, No.1, pp.98-113, 1997. https://doi.org/10.1109/72.554195
  10. L. Bo, X. Ren, and D. Fox, "Hierarchical matching pursuit for image classification: architecture and fast algorithms," Advances in Neural Iinformation Processing Systems, pp. 2115-2123, 2011.
  11. L. Bo, X. Ren, and D. Fox, "Kernel descriptors for visual recognition," Proc. of Neural Information Processing Systems, pp.244-252, 2010.
  12. N. Silberman and R. Fergus, "Indoor scene segmentation using a structured light sensor," Proc. of IEEE Intl. Conf. on Computer Vision, pp.601-608, 2011.
  13. R. B. Rusu, A. Holzbach, M. Beetz, and G. Bradski, "Detecting and segmenting objects for mobile manipulation," Proc. of IEEE Intl. Conf. on Computer Vision, pp.47-54, 2009.
  14. R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," Proc. of IEEE Conf. Computer Vision and Pattern Recognition, pp.580-587, 2014.
  15. K. Lai, L. Bo, X. Ren, and D. Fox, "A large-scale hierarchical multi-view rgb-d object dataset," Proc. of IEEE Intl. Conf. on Robotics and Automation, pp.1817-1824, 2011.
  16. LIBSVM [Internet], http://www.csie.ntu.edu.tw/-cjlin/libsvm/.