Browse > Article
http://dx.doi.org/10.7472/jksii.2020.21.5.149

Performance Analyzer for Embedded AI Processor  

Hwang, Dong Hyun (Dept. of Electronic Engineering, Seoul National University of Science and Technology)
Yoon, Young Hyun (Dept. of Electronic Engineering, Seoul National University of Science and Technology)
Han, Chang Yeop (Dept. of Electronic Engineering, Seoul National University of Science and Technology)
Lee, Seung Eun (Dept. of Electronic Engineering, Seoul National University of Science and Technology)
Publication Information
Journal of Internet Computing and Services / v.21, no.5, 2020 , pp. 149-157 More about this Journal
Abstract
Recently, as interest in artificial intelligence has increased, many studies have been conducted to implement AI processors. However, the AI processor requires functional verification as well as performance verification on whether the AI processor is suitable for the application. In this paper, We propose an AI processor performance analyzer that can verify the application performance and explore the limitations of the processor. By Using the performance analyzer, we explore the limitations of the AI processor and optimize the AI model to fit an AI processor in image recognition and speech recognition applications.
Keywords
AI processor; Simulator; Artificial intelligence; Embedded system; SoC;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Yoon, Y.H, Hwang, D.H, Yang, J.H, Lee, S.E, "Intellino: Processor for Embedded Artificial Intelligence," Electronics 2020, 9, 1169. https://doi.org/10.3390/electronics9071169   DOI
2 H. Yoon, K. Kim, J. Chun, "GAN-based shadow removal using context information," Journal of Internet Computing and Services, vol. 20, no. 6, pp. 29-36, 2019. https://doi.org/10.7472/jksii.2019.20.6.29.   DOI
3 S. Yoo, "A study on counting number of passengers by moving object detection," Journal of Internet Computing and Services, vol. 21, no. 2, pp. 9-18, 2020. https://doi.org/10.7472/jksii.2020.21.2.9.   DOI
4 Geoffrey Hinton, Oriol Vinyals, Jeff Dean, "Distilling the Knowledge in a Neural Network", 2015. https://arxiv.org/abs/1503.02531.
5 Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam, "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications," 2017. https://arxiv.org/abs/1704.04861
6 X. Zhang, X. Zhou, M. Lin, J. Sun, "ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices," 2017. https://arxiv.org/abs/1707.01083
7 M. Rastegari, V. Ordonez, J. Redmon, A. Farhadi, "XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks," 2016. https://arxiv.org/abs/1603.05279.
8 Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, Jian Sun, "ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design," 2018. https://arxiv.org/abs/1807.11164.
9 M. Courbariaux, Y. Bengio, J. David, "BinaryConnect: Training Deep Neural Networks with binary weights during propagations," 2015. https://arxiv.org/abs/1511.00363.
10 Matthieu Courbariaux, Yoshua Bengio, Jean-Pierre David, "Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1," 2016. https://arxiv.org/abs/1602.02830.
11 Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, Ali Farhadi, "Regularized Binary Network Training," 2018. https://arxiv.org/abs/1812.11800.
12 Zhuang Liu, Mingjie Sun, Tinghui Zhou, Gao Huang, Trevor Darrell, "Rethinking the Value of Network Pruning," 2018. https://arxiv.org/abs/1810.05270.
13 Choi Woong, Kwanghyo Jeong, Kyungrak Choi, Kyeongho Lee, Jongsun Park, "Content Addressable Memory Based Binarized Neural Network Accelerator Using Time-Domain Signal Processing," In Proceedings of the 55th Annual Design Automation Conference. Association for Computing Machinery, 2018. https://doi.org/10.1145/3195970.3196014
14 Liu W., Chawla S., "Class Confidence Weighted kNN Algorithms for Imbalanced Data Sets," Advances in Knowledge Discovery and Data Mining. PAKDD. Lecture Notes in Computer Science, vol 6635. Springer, Berlin, Heidelberg, 2011. https://doi.org/10.1007/978-3-642-20847-8_29
15 M. Blott, T. B. Preußer, N. J. Fraser, G. Gambardella, K. O'brien, Y. Umuroglu, M. Leeser, K. Vissers, "Finn-r: An end-to-end deep-learning framework for fast exploration of quantized neural networks," ACM Transactions on Reconfigurable Technology and Systems (TRETS), 11 (3):16, 2018. https://doi.org/10.1145/3242897
16 Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen, "MobileNetV2: Inverted Residuals and Linear Bottlenecks," 2018. https://arxiv.org/abs/1801.04381.
17 M. Zhu, T. Zhang, Z. Gu, Y. Xie, "Sparse tensor core: Algorithm and hardware co-design for vector-wise sparse neural networks on modern gpus," MICRO '52, pp. 359-371, 2019. https://doi.org/10.1145/3352460.3358269
18 W. Hua, Y. Zhou, C. Sa, Z. Zhang, G. Suh, "Boosting the performance of cnn accelerators with dynamic fine-grained channel gating," In Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 52, pp. 139150, 2019. https://doi.org/10.1145/3352460. 3358283.
19 S. Y. Jang, Y. H. Yoon, S. E. Lee, "Stochastic Computing based AI System for Mobile Devices," IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2020, pp. 1-2, 2020. https://doi.org/10.1109/ICCE46568.2020.9042978.
20 Harpham, C., Dawson, C.W. & Brown, M.R, "A review of genetic algorithms applied to training radial basis function networks," Neural Computing & Applications 13, 193-201, 2004. https://doi.org/10.1007/s00521-004-0404-5   DOI
21 Pete Warden, "Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition", 2018. https://arxiv.org/abs/1804.03209
22 Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, 86(11): 2278-2324, 1998. https://doi.org/10.1109/5.726791