DOI QR코드

DOI QR Code

Small Sample Face Recognition Algorithm Based on Novel Siamese Network

  • Zhang, Jianming (Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation and School of Computer and Communication Engineering, Changsha University of Science and Technology) ;
  • Jin, Xiaokang (Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation and School of Computer and Communication Engineering, Changsha University of Science and Technology) ;
  • Liu, Yukai (Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation and School of Computer and Communication Engineering, Changsha University of Science and Technology) ;
  • Sangaiah, Arun Kumar (School of Computer Science and Engineering, Vellore Institute of Technology (VIT)) ;
  • Wang, Jin (Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation and School of Computer and Communication Engineering, Changsha University of Science and Technology)
  • Received : 2018.08.31
  • Accepted : 2018.10.22
  • Published : 2018.12.31

Abstract

In face recognition, sometimes the number of available training samples for single category is insufficient. Therefore, the performances of models trained by convolutional neural network are not ideal. The small sample face recognition algorithm based on novel Siamese network is proposed in this paper, which doesn't need rich samples for training. The algorithm designs and realizes a new Siamese network model, SiameseFacel, which uses pairs of face images as inputs and maps them to target space so that the $L_2$ norm distance in target space can represent the semantic distance in input space. The mapping is represented by the neural network in supervised learning. Moreover, a more lightweight Siamese network model, SiameseFace2, is designed to reduce the network parameters without losing accuracy. We also present a new method to generate training data and expand the number of training samples for single category in AR and labeled faces in the wild (LFW) datasets, which improves the recognition accuracy of the models. Four loss functions are adopted to carry out experiments on AR and LFW datasets. The results show that the contrastive loss function combined with new Siamese network model in this paper can effectively improve the accuracy of face recognition.

Keywords

E1JBB0_2018_v14n6_1464_f0001.png 이미지

Fig.1 Siamese network algorithm architecture.

E1JBB0_2018_v14n6_1464_f0002.png 이미지

Fig. 2. SiameseFacel network architecture.

E1JBB0_2018_v14n6_1464_f0003.png 이미지

Fig. 3. SiameseFace2 network architecture.

E1JBB0_2018_v14n6_1464_f0004.png 이미지

Fig. 4. AR dataset part of the face image.

E1JBB0_2018_v14n6_1464_f0005.png 이미지

Fig. 5. LFW dataset part of the face image.

E1JBB0_2018_v14n6_1464_f0006.png 이미지

Fig. 6. matched pairs and unmatched paris generation.

E1JBB0_2018_v14n6_1464_f0007.png 이미지

Fig. 7. Different model loss convergence performance comparison chart.

Table 1. SiameseFacel network architecture.

E1JBB0_2018_v14n6_1464_t0001.png 이미지

Table 2. SiameseFace2 network parameter

E1JBB0_2018_v14n6_1464_t0002.png 이미지

Table 3. SiameseFace2 network parameter

E1JBB0_2018_v14n6_1464_t0003.png 이미지

Table 4. Five different network model's configuration and recognition rate

E1JBB0_2018_v14n6_1464_t0004.png 이미지

Table 5. Experimental results on the AR dataset

E1JBB0_2018_v14n6_1464_t0005.png 이미지

Table 6. Experimental results on the LFW dataset

E1JBB0_2018_v14n6_1464_t0006.png 이미지

Table 7. Comparison of different loss functions on AR dataset

E1JBB0_2018_v14n6_1464_t0007.png 이미지

References

  1. I. D. Stephen, V. Hiew, V. Coetzee, B. P. Tiddeman, and D. I. Perrett, "Facial shape analysis identifies valid cues to aspects of physiological health in Caucasian, Asian, and African populations," Frontiers in Psychology, vol. 8, article no. 1883, 2017.
  2. R. Blanco-Gonzalo, N. Poh, R. Wong, and R. Sanchez-Reillo, "Time evolution of face recognition in accessible scenarios," Human-centric Computing and Information Sciences, vol. 5, article no. 24, 2015.
  3. B. Maze, J. Adams, J. A. Duncan, N. Kalka, T. Miller, C. Otto, et al., "IARPA Janus Benchmark-C: face dataset and protocol," in Proceedings of the 11th IAPR International Conference on Biometrics (ICB), Gold Coast, Australia, 2018.
  4. F. Liu, Y. Bi, Y. Cui, and Z. Tang, "Local similarity based linear discriminant analysis for face recognition with single sample per person," in Computer Vision-ACCV 2014 Workshop. Cham: Springer, 2014, pp. 85-95.
  5. F. Tsalakanidou, D. Tzovaras, and M. G. Strintzis, "Use of depth and colour eigenfaces for face recognition," Pattern Recognition Letters, vol. 24, no. 9-10, pp. 1427-1435, 2003. https://doi.org/10.1016/S0167-8655(02)00383-5
  6. X. He, S. Yan, Y. Hu, P. Niyogi, and H. J. Zhang, "Face recognition using laplacianfaces," IEEE Transactions on Pattern Analysis And Machine Intelligence, vol. 27, no. 3, pp. 328-340, 2005. https://doi.org/10.1109/TPAMI.2005.55
  7. Y. Tu, Y. Lin, J. Wang, and J. U. Kim, "Semi-supervised learning with generative adversarial networks on digital signal modulation classification," Computers Materials & Continua, vol. 55, no. 2, pp. 243-254, 2018.
  8. N. Yu, Z. Yu, F. Gu, T. Li, X. Tian, and Y. Pan, "Deep learning in genomic and medical image data analysis: challenges and approaches," Journal of Information Processing Systems, vol. 13, no. 2, pp. 204-214, 2017. https://doi.org/10.3745/JIPS.04.0029
  9. K. M. Koo and E. Y. Cha, "Image recognition performance enhancements using image normalization," Human-centric Computing and Information Sciences, vol. 7, article no. 33, 2017.
  10. T. N. Sainath, R. J. Weiss, K. W. Wilson, B. Li, A. Narayanan, E. Variani, et al., "Multichannel signal processing with deep neural networks for automatic speech recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 5, pp. 965-979, 2017. https://doi.org/10.1109/TASLP.2017.2672401
  11. S. G. Lee, Y. Sung, Y. G. Kim, and E. Y. Cha, "Variations of AlexNet and GoogLeNet to improve Korean character recognition performance," Journal of Information Processing Systems, vol. 14, no. 1, pp. 205-217, 2018. https://doi.org/10.3745/JIPS.04.0061
  12. S. Chopra, R. Hadsell, and Y. LeCun, "Learning a similarity metric discriminatively, with application to face verification," in Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, 2005, pp. 539-546.
  13. J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, "Robust face recognition via sparse representation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210-227, 2009. https://doi.org/10.1109/TPAMI.2008.79
  14. M. Yang and L. Zhang, "Gabor feature based sparse representation for face recognition with Gabor occlusion dictionary," in Computer Vision-ECCV 2010. Heidelberg: Springer, 2010, pp. 448-461.
  15. L. Zhang, M. Yang, and X. Feng, "Sparse representation or collaborative representation: which helps face recognition?," in Proceedings of 2011 IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 2011, pp. 471-478.
  16. D. M. Vo and S. W. Lee, "Robust face recognition via hierarchical collaborative representation," Information Sciences, vol. 432, pp. 332-346, 2018. https://doi.org/10.1016/j.ins.2017.12.014
  17. C. Li, S. Zhao, K. Xiao, and Y. Wang, "Face recognition based on the combination of enhanced local texture feature and DBN under complex illumination conditions," Journal of Information Processing Systems, vol. 14, no. 1, pp. 191-214, 2018. https://doi.org/10.3745/JIPS.04.0060
  18. C. Whitelam, E. Taborsky, A. Blanton, B. Maze, J. C. Adams, T. Miller, et al., "IARPA Janus Benchmark-B face dataset," in Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Honolulu, HI, 2017, pp. 90-98.
  19. L. Tran, X. Yin, and X. Liu, "Disentangled representation learning GAN for pose-invariant face recognition," in Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 2017, pp. 1415-1424.
  20. D. Chen, X. Cao, F. Wen, and J. Sun, "Blessing of dimensionality: High-dimensional feature and its efficient compression for face verification," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, 2013, pp. 3025-3032).
  21. K. Simonyan, O. M. Parkhi, A. Vedaldi, and A. Zisserman, "Fisher vector faces in the Wild," in Proceedings of the British Machine Vision Conference (BMVC), Bristol, UK, 2013.
  22. Z. Zhu, P. Luo, X. Wang, and X. Tang, "Recover canonical-view faces in the wild with deep neural networks," 2014 [Online]. Available: https://arxiv.org/abs/1404.3543.
  23. Y. Taigman, M. Yang, M.A. Ranzato, and L. Wolf, "DeepFace: closing the gap to human-level performance in face verification," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, 2014, pp. 1701-1708.
  24. Y. Sun, X. Wang, and X. Tang, "Deep learning face representation from predicting 10,000 classes," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, 2014, pp. 1891-1898.
  25. Y. Sun, Y. Chen, X. Wang, and X. Tang, "Deep learning face representation by joint identification-verification," Advances in Neural Information Processing Systems, vol. 27, pp. 1988-1996, 2014.
  26. M. Parchami, S. Bashbaghi, and E. Granger, "Video-based face recognition using ensemble of Haar-like deep convolutional neural networks," in Proceedings of 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 2017, pp. 4625-4632.
  27. F. Schroff, D. Kalenichenko, and J. Philbin, "FaceNet: a unified embedding for face recognition and clustering," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 815-823.
  28. D. Chen, X. Cao, L. Wang, F. Wen, and J. Sun, "Bayesian face revisited: a joint formulation," in Computer Vision-ECCV 2012. Heidelberg: Springer, 2012, pp. 566-579.
  29. O. Barkan, J. Weill, L. Wolf, and H. Aronowitz, "Fast high dimensional vector multiplication face recognition," in Proceedings of the IEEE International Conference on Computer Vision, Sydney, Australia, 2013, pp. 1960-1967.
  30. S. Berlemont, G. Lefebvre, S. Duffner, and C. Garcia, "Class-balanced Siamese neural networks," Neurocomputing, vol. 273, pp. 47-56, 2018. https://doi.org/10.1016/j.neucom.2017.07.060
  31. U. Shaham and R. R. Lederman, "Learning by coincidence: Siamese networks and common variable learning," Pattern Recognition, vol. 74, pp. 52-63, 2018. https://doi.org/10.1016/j.patcog.2017.09.015