Search | Korea Science

Ming, Yue;Wang, Guangchao;Hong, Xiaopeng
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.11 no.3
- /
- pp.1595-1613
- /
- 2017
The IR camera and laser-based IR projector provide an effective solution for real-time collection of moving targets in RGB-D videos. Different from the traditional RGB videos, the captured depth videos are not affected by the illumination variation. In this paper, we propose a novel feature extraction framework to describe human activities based on the above optical video capturing method, namely spatial-temporal texture features for 3D human activity recognition. Spatial-temporal texture feature with depth information is insensitive to illumination and occlusions, and efficient for fine-motion description. The framework of our proposed algorithm begins with video acquisition based on laser projection, video preprocessing with visual background extraction and obtains spatial-temporal key images. Then, the texture features encoded from key images are used to generate discriminative features for human activity information. The experimental results based on the different databases and practical scenarios demonstrate the effectiveness of our proposed algorithm for the large-scale data sets.
https://doi.org/10.3837/tiis.2017.03.019 인용 PDF KSCI

Kim, Joo-Hee;Kim, In-Cheol
- Journal of Institute of Control, Robotics and Systems
- /
- v.21 no.11
- /
- pp.996-1002
- /
- 2015
In this paper, we present an effective system for the 3D scene labeling of objects from RGB-D videos. Our system uses a Markov Random Field (MRF) over a voxel representation of the 3D scene. In order to estimate the correct label of each voxel, the probabilistic graphical model integrates both scores from sliding window-based object detectors and also from object location prior maps. Both the object detectors and the location prior maps are pre-trained from manually labeled RGB-D images. Additionally, the model integrates the scores from considering the geometric constraints between adjacent voxels in the label estimation. We show excellent experimental results for the RGB-D Scenes Dataset built by the University of Washington, in which each indoor scene contains tabletop objects.
https://doi.org/10.5302/J.ICROS.2015.15.0159 인용 PDF KSCI