DOI QR코드

DOI QR Code

Uncooperative Person Recognition Based on Stochastic Information Updates and Environment Estimators

  • Received : 2014.02.01
  • Accepted : 2015.02.03
  • Published : 2015.04.01

Abstract

We address the problem of uncooperative person recognition through continuous monitoring. Multiple modalities, such as face, height, clothes color, and voice, can be used when attempting to recognize a person. In general, not all modalities are available for a given frame; furthermore, only some modalities will be useful as some frames in a video sequence are of a quality that is too low to be able to recognize a person. We propose a method that makes use of stochastic information updates of temporal modalities and environment estimators to improve person recognition performance. The environment estimators provide information on whether a given modality is reliable enough to be used in a particular instance; such indicators mean that we can easily identify and eliminate meaningless data, thus increasing the overall efficiency of the method. Our proposed method was tested using movie clips acquired under an unconstrained environment that included a wide variation of scale and rotation; illumination changes; uncontrolled distances from a camera to users (varying from 0.5 m to 5 m); and natural views of the human body with various types of noise. In this real and challenging scenario, our proposed method resulted in an outstanding performance.

Keywords

References

  1. Y.-B. Lee and S. Lee, "Robust Face Detection Based on Knowledge-Directed Specification of Bottom-Up Saliency," ETRI J., vol. 33, no. 4, Aug. 2011, pp. 600-610. https://doi.org/10.4218/etrij.11.1510.0123
  2. N. Bellotto and H. Hu, "Multisensor Data Fusion for Joint People Tracking and Identification with a Service Robot," IEEE Int. Conf. Robot Biomimetics, Sanya, China, Dec. 15-18, 2007, pp. 1494-1499.
  3. D. Jo. et al., "Tracking and Interaction Based on Hybrid Sensing for Virtual Environments," ETRI J., vol. 35, no. 2, Apr. 2013, pp. 356-359. https://doi.org/10.4218/etrij.13.0212.0170
  4. S. Zhou and R. Chellappa, "Probabilistic Human Recognition from Video," European Conf. Comput. Vis., Copenhagen, Denmark, May 27-31, 2002, pp. 681-697.
  5. S. Zhou and R. Chellappa, "Simultaneous Tracking and Recognition of Human Faces from Video," IEEE Int. Conf. Acoustic, Speech Signal Process., vol. 3, Hong Kong, China, Apr. 6-10, 2003, pp. 225-228.
  6. N. Seo, "Simultaneous Multi-view Face Tracking and Recognition in Video Using Particle Filtering," M.S. thesis, Department of Electrical and Computer Engineering, University of Maryland, College Park of Maryland, MD, USA, 2009.
  7. M. Kim et al., "Face Tracking and Recognition with Visual Constraints in Real-World Videos," IEEE Conf. Comput. Vis. Pattern Recogn., Anchorage, AK, USA, June 23-28, 2008, pp. 1-8.
  8. M. Farenzena et al., "Person Re-identification by Symmetry-Driven Accumulation of Local Features," IEEE Conf. Comput. Vis. Pattern Recogn., San Francisco, CA, USA, June 13-18, 2010, pp. 2360-2367.
  9. E.J. Ploran et al., "Evidence Accumulation and the Moment of Recognition: Dissociating Perceptual Recognition Processes Using fMRI," J. Neurosci., vol. 27, no. 44, Oct. 31, 2007, pp. 11912-11924. https://doi.org/10.1523/JNEUROSCI.3522-07.2007
  10. W. Kim et al., "Human Action Recognition Using Ordinal Measure of Accumulated Motion," EURASIP J. Adv. Signal Process., Apr. 2010, pp. 1-11.
  11. M. Lucenal et al., "Human Action Recognition Using Optical Flow Accumulated Local Histograms," IbPRAI, Povoa de Varzim, Portugal, June 10-12, 2009, pp. 32-39.
  12. M.E. Nilsback and R. Caputo, "Cue Integration through Discriminative Accumulation," IEEE Conf. Comput. Vis. Pattern Recogn., vol. 2, Washington, DC, USA, June 27-July 2, 2004, pp. 578-585.
  13. M.-E. Nilsback, "A Cue-Integration Scheme for Object Recognition Using Discriminative Accumulation," M.S. thesis, Department of Numerical Analysis and Computer Science, Royal Institute of Technology, Stockholm, Sweden, 2004.
  14. A. Pronobis and B. Caputo, "Confidence-Based Cue Integration for Visual Place Recognition," IEEE/RSJ Int. Conf. Intell. Robots Syst., San Diego, CA, USA, Oct. 29-Nov. 2, 2007, pp. 2394-2401.
  15. J. Lee et al., "Integrating Evidences of Independently Developed Face and Speaker Recognition Systems by Using Discrete Probability Density Function," IEEE Int. Symp. Robot Human Interactive Commun., Jeju, Rep. of Korea, Aug. 26-29, 2007, pp. 667-672.
  16. O. Velek, S. Jaeger, and M. Nakagawa, "Accumulated-Recognition-Rate Normalization for Combining Multiple On/Off-Line Japanese Character Classifiers Tested on a Large Database," Multiple Classifier Systems, Guildford, UK: Springer Berlin Heidelberg, vol. 2709, 2003, pp. 196-205.
  17. J. Pelecanos, U. Chaudhari, and G. Ramaswamy, "Compensation of Utterance Length for Speaker Verification," Proc. ODYSSEY, Toledo, Spain, May 31-June 3, 2004, pp. 161-164.
  18. H.-J. Kim et al., "Multi-modal User Recognition Based on Environmental Parameters," Proc. Frontier Comput. Vis., Japan, Feb. 2010, pp. 342-345.
  19. M. Li et al., "Rapid and Robust Human Detection and Tracking Based on Omega-Shape Features," IEEE Int. Conf. Image Process., Cairo, Egypt, Nov. 7-10, 2009, pp. 2545-2548.
  20. S. Mukeriee and K. Das, "A Novel Equation Based Classifier for Detecting Human in Images," Int. J. Comput. Appl., vol. 72, no. 6, June 2013, pp. 9-16. https://doi.org/10.5120/12496-7272
  21. K.-D. Ban et al., "Tiny and Blurred Face Alignment for Long Distance Face Recognition," ETRI J., vol. 33, no. 2, Apr. 2011, pp. 251-258. https://doi.org/10.4218/etrij.11.1510.0022
  22. D.-H. Kim et al., "A Vision-Based User Authentification System in Robot Environments by Using Semi-Biometrics and Tracking," IEEE/RSJ Intell. Robots Syst., Alberta, Canada, Aug. 2-6, 2005, pp. 1812-1817.
  23. S. Kim, M. Ji, and H. Kim, "Noise-Robust Speaker Recognition Using Subband Likelihoods and Reliable-Feature Selection," ETRI J., vol. 30, no. 1, Feb. 2008, pp. 89-100. https://doi.org/10.4218/etrij.08.0107.0108
  24. A.S. Georghiades, P.N. Belhumeur, and D.J. Kriegman, "From Few to Many: Illumination Cone Models for Face Recognition Under Variable Lighting and Pose," IEEE Trans. Pattern Anal. Mach. Intell., vol. 23, no. 6, June 2001, pp. 643-660. https://doi.org/10.1109/34.927464
  25. K.-C. Lee, J. Ho, and D.J. Kriegman, "Acquiring Linear Subspaces for Face Recognition Under Variable Lighting," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 5, May 2005, pp. 684-698. https://doi.org/10.1109/TPAMI.2005.92

Cited by

  1. A face recognition system based on convolution neural network using multiple distance face vol.21, pp.17, 2015, https://doi.org/10.1007/s00500-016-2095-0