• Title/Summary/Keyword: Approximated Inference

Search Result 13, Processing Time 0.016 seconds

Deep compression of convolutional neural networks with low-rank approximation

  • Astrid, Marcella;Lee, Seung-Ik
    • ETRI Journal
    • /
    • v.40 no.4
    • /
    • pp.421-434
    • /
    • 2018
  • The application of deep neural networks (DNNs) to connect the world with cyber physical systems (CPSs) has attracted much attention. However, DNNs require a large amount of memory and computational cost, which hinders their use in the relatively low-end smart devices that are widely used in CPSs. In this paper, we aim to determine whether DNNs can be efficiently deployed and operated in low-end smart devices. To do this, we develop a method to reduce the memory requirement of DNNs and increase the inference speed, while maintaining the performance (for example, accuracy) close to the original level. The parameters of DNNs are decomposed using a hybrid of canonical polyadic-singular value decomposition, approximated using a tensor power method, and fine-tuned by performing iterative one-shot hybrid fine-tuning to recover from a decreased accuracy. In this study, we evaluate our method on frequently used networks. We also present results from extensive experiments on the effects of several fine-tuning methods, the importance of iterative fine-tuning, and decomposition techniques. We demonstrate the effectiveness of the proposed method by deploying compressed networks in smartphones.

On the Generation of Line Balanced Assembly Sequences Based on the Evaluation of Assembly Work Time Using Neural Network (신경회로망기법에 의한 조립작업시간의 추정 및 라인밸런싱을 고려한 조립순서 추론)

  • 신철균;조형석
    • Transactions of the Korean Society of Mechanical Engineers
    • /
    • v.18 no.2
    • /
    • pp.339-350
    • /
    • 1994
  • This paper presents a method for automatic generation of line balanced assembly sequences based on disassemblability and proposes a method of evaluating an assembly work time using neural networks. Since a line balancing problem in flexible assembly system requires a sophisticated planning method, reasoning about line balanced assembly sequences is an important field of concern for planning assembly lay-out. For the efficient inference of line balanced assembly sequences, many works have been reported on how to evaluate an assembly work time at each work station. However, most of them have some limitations in that they use cumbersome user query or approximated assembly work time data without considering assembly conditions. To overcome such criticism, this paper proposes a new approach to mathematically verify assembly conditions based on disassemblability. Based upon the results, we present a method of evaluating assembly work time using neural networks. The proposed method provides an effective means of solving the line balancing problem and gives a design guidance of planning assembly lay-out in flexible assembly application. An example study is given to illustrate the concepts and procedure of the proposed scheme.

The Impact of Various Degrees of Composite Minimax ApproximatePolynomials on Convolutional Neural Networks over Fully HomomorphicEncryption (다양한 차수의 합성 미니맥스 근사 다항식이 완전 동형 암호 상에서의 컨볼루션 신경망 네트워크에 미치는 영향)

  • Junghyun Lee;Jong-Seon No
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.33 no.6
    • /
    • pp.861-868
    • /
    • 2023
  • One of the key technologies in providing data analysis in the deep learning while maintaining security is fully homomorphic encryption. Due to constraints in operations on fully homomorphically encrypted data, non-arithmetic functions used in deep learning must be approximated by polynomials. Until now, the degrees of approximation polynomials with composite minimax polynomials have been uniformly set across layers, which poses challenges for effective network designs on fully homomorphic encryption. This study theoretically proves that setting different degrees of approximation polynomials constructed by composite minimax polynomial in each layer does not pose any issues in the inference on convolutional neural networks.