[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.6109/jicce.2021.19.2.79

Novel Image Classification Method Based on Few-Shot Learning in Monkey Species

Wang, Guangxing (Information Technology Center, Jiujiang University)
Lee, Kwang-Chan (School of Computer Information & Communication Engineering, Kunsan National University)
Shin, Seong-Yoon (School of Computer Information & Communication Engineering, Kunsan National University)

Publication Information

Journal of information and communication convergence engineering / v.19, no.2, 2021 , pp. 79-83 More about this Journal

Abstract

This paper proposes a novel image classification method based on few-shot learning, which is mainly used to solve model overfitting and non-convergence in image classification tasks of small datasets and improve the accuracy of classification. This method uses model structure optimization to extend the basic convolutional neural network (CNN) model and extracts more image features by adding convolutional layers, thereby improving the classification accuracy. We incorporated certain measures to improve the performance of the model. First, we used general methods such as setting a lower learning rate and shuffling to promote the rapid convergence of the model. Second, we used the data expansion technology to preprocess small datasets to increase the number of training data sets and suppress over-fitting. We applied the model to 10 monkey species and achieved outstanding performances. Experiments indicated that our proposed method achieved an accuracy of 87.92%, which is 26.1% higher than that of the traditional CNN method and 1.1% higher than that of the deep convolutional neural network ResNet50.

Keywords

Deep learning; Feature extraction; Few-shot learning; Image classification;

Citations & Related Records

Reference

1	A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," Communications on ACM, vol. 60, no. 6, pp. 84-90, 2017. DOI: 10.1145/3065386. DOI
2	C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, pp. 1-9, 2015.
3	K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of Computer Vision & Pattern Recognition 2016, Las Vegas, NV, USA, pp. 770-778, 2016.
4	W. Zhang, "Shift-invariant pattern recognition neural network and its optical architecture," in Proceedings of Annual Conference of The Japan Society of Applied Physics, Montreal, CA, 1988.
5	O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, "ImageNet large scale visual recognition challenge," International Journal of Computer Vision, vol. 115, pp. 211-252, 2015. DOI: 10.1007/s11263-015-0816-y. DOI
6	N. Bendre, H. T. Marin, and P. Najafirad, "Learning from few samples: A survey," Computer Vision and Pattern Recognition, pp. 1-17, 2007.
7	Y. Wang, Q. Yao, J. T. Kwok, and L. M. Ni, "Generalizing from a few examples: A survey on few-shot learning," ACM Computing Survey (CSUR), vol. 53, no. 3, pp. 1-34, 2020.
8	Kaggle, 10 monkey species [Internet], Available: https://www.kaggle.com/slothkong/10-monkey-species.
9	G. X. Wang and S. Y. Shin, "An improved text classification method for sentiment classification," Journal of Information and Communication Convergence Engineering, vol. 17, no. 1, pp. 41-48, 2019. DOI: 10.6109/jicce.2019.17.1.41. DOI
10	D. P. Kingma and J. L. Ba, "Adam: A method for stochastic optimization," in The 3rd International Conference for Learning Representations, San Diego, CA, USA, pp. 1-15, 2015.
11	L. Jiao, F. Zhang, F. Liu, S. Yang, L. Li, Z. Feng, and R. Qu, "A survey of deep learning-based object detection," IEEE Access, vol. 7, pp. 128837-128868, 2019. DOI: 10.1109/ACCESS.2019.2939201. DOI
12	H. Laga, L. V. Jospin, F. Boussaid, and M. Bennamoun, "A survey on deep learning techniques for stereo-based depth estimation," IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1-27, 2020. DOI: 10.1109/TPAMI.2020.3032602. DOI
13	J. Lu, P. Gong, J. Ye, and C. Zhang, "Learning from very few samples: A survey," Computer Vision and Pattern Recognition, pp. 1-30, 2020.
14	Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," in Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998. DOI: 10.1109/5.726791. DOI
15	K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," in Proceedings of International Conference on Learning Representations, San Diego, CA, USA, 2014. DOI: arxiv.org/abs/1409.1556.
16	J. Deng, W. Dong, R. Socher, L. J. Li, K. Li, and L. Fei-Fei, "ImageNet: A large-scale hierarchical image database," in 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami: FL, pp. 248-255, 2009. DOI: 10.1109/CVPR.2009.5206848. DOI
17	Kaggle, Dogs vs. cats [internet], Available: https://www.kaggle.com/salader/dogsvscats.