Browse > Article
http://dx.doi.org/10.9717/kmms.2018.21.12.1467

Low Resolution Rate Face Recognition Based on Multi-scale CNN  

Wang, Ji-Yuan (Dept. of Information and Communication Engineering, Tongmyong University)
Lee, Eung-Joo (Dept. of Information and Communication Engineering, Tongmyong University)
Publication Information
Abstract
For the problem that the face image of surveillance video cannot be accurately identified due to the low resolution, this paper proposes a low resolution face recognition solution based on convolutional neural network model. Convolutional Neural Networks (CNN) model for multi-scale input The CNN model for multi-scale input is an improvement over the existing "two-step method" in which low-resolution images are up-sampled using a simple bi-cubic interpolation method. Then, the up sampled image and the high-resolution image are mixed as a model training sample. The CNN model learns the common feature space of the high- and low-resolution images, and then measures the feature similarity through the cosine distance. Finally, the recognition result is given. The experiments on the CMU PIE and Extended Yale B datasets show that the accuracy of the model is better than other comparison methods. Compared with the CMDA_BGE algorithm with the highest recognition rate, the accuracy rate is 2.5%~9.9%.
Keywords
Face Recognition; Intelligent Video Analysis Method; CNN; Multi-scale CNN;
Citations & Related Records
연도 인용수 순위
  • Reference
1 W.K. Xu and E.J. Lee, "Human-computer Catural User Interface Based on Hand Motion Detection and Tracking," The Journal of Multimedia Information System, Vol. 15, No. 4, pp. 501-507, 2012.
2 R.T. Collins, A.J. Lipton, and T. Kanade, A System for Video Surveillance and Monitoring, Vsam Final Report Carnegie Mellon University Technical Report, 2000.
3 Siebel, T. Nils, and S.J. Maybank, "The Advisor Visual Surveillance System," Proceeding of European Conference on Computer Vision 2004 Workshop Applications of Computer Vision, pp. 103-111, 2004.
4 C.F. Shu, A. Hampapur, M. Lu, L. Brown, J. Cannell, A. Senior, Y.L. Tian, et al., "IBM Smart Surveillance System (S3): a Open and Extensible Framework for Event based Surveillance," Proceeding of IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 318-323, 2005.
5 K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," Computer Science, 2014.
6 E. Ahmed, M. Jones, and T.K. Marks, "An Improved Deep Learning Architecture for Person Re-identification," Computer Vision and Pattern Recognition, pp. 3908-3916, 2015.
7 K.M. He, X.Y. Zhang, S.Q. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," arXiv, arXiv:1512.03385, 2015.
8 K.T. Lim, H.W. Kang and J.K. lee,"Moving Shadow Detection using Deep Learning and Markov Random Field," journal of multimedia information system, Vol. 18, No. 12, pp. 1432- 1438, 2015
9 G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R.R. Salakhutdinov, "Improving Neural Networks by Preventing Co-adaptation of Feature Detectors," Computer Science, Vol. 3, No. 4, pp. 212-223, 2012.
10 L. Wan, M.D. Zeiler, S.X. Zhang, Y. Lecun, and R. Fergus, "Regularization of Neural Networks using DropConnect," Proceeding of International Conference on Machine Learning, pp. 1058-1066, 2013.
11 F. Rosenblatt, "The Perception: a Probabilistic Model for Information Storage and Organization in the Brain," American Psychological Association, Vol. 65, No. 6, pp. 386-408, 1958.
12 R. Gross, I. Matthews, J. Cohn, T. Kanade, and S. Baker, "Multi-PIE," Image and Vision Computing, Vol. 28, No. 5, pp. 807-813, 2010.   DOI
13 A.S. Georghiades, P.N. Belhumeur, and D.J. Kriegman, "From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 12, No. 6, pp. 643-660, 2002.