Search | Korea Science

Kim, Ji-Min;Kim, In-Mo;Kim, Myung-Sun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.26 no.6
- /
- pp.842-849
- /
- 2022
With the development of deep learning technology, there are many cases of using DNNs in embedded systems such as unmanned vehicles, drones, and robotics. Typically, in the case of an autonomous driving system, it is crucial to run several DNNs which have high accuracy results and large computation amount at the same time. However, running multiple DNNs simultaneously in an embedded system with relatively low performance increases the time required for the inference. This phenomenon may cause a problem of performing an abnormal function because the operation according to the inference result is not performed in time. To solve this problem, the solution proposed in this paper first reduces the computation by applying the Tucker decomposition to DNN models with big computation amount, and then, make DNN models run in parallel as much as possible in the unit of hidden layer inside the GPU. The experimental result shows that the DNN inference time decreases by up to 75.6% compared to the case before applying the proposed technique.
https://doi.org/10.6109/jkiice.2022.26.6.842 인용 PDF KSCI