• Title/Summary/Keyword: First-order gradient optimization

Search Result 22, Processing Time 0.025 seconds

Gradient Descent Training Method for Optimizing Data Prediction Models (데이터 예측 모델 최적화를 위한 경사하강법 교육 방법)

  • Hur, Kyeong
    • Journal of Practical Engineering Education
    • /
    • v.14 no.2
    • /
    • pp.305-312
    • /
    • 2022
  • In this paper, we focused on training to create and optimize a basic data prediction model. And we proposed a gradient descent training method of machine learning that is widely used to optimize data prediction models. It visually shows the entire operation process of gradient descent used in the process of optimizing parameter values required for data prediction models by applying the differential method and teaches the effective use of mathematical differentiation in machine learning. In order to visually explain the entire operation process of gradient descent, we implement gradient descent SW in a spreadsheet. In this paper, first, a two-variable gradient descent training method is presented, and the accuracy of the two-variable data prediction model is verified by comparison with the error least squares method. Second, a three-variable gradient descent training method is presented and the accuracy of a three-variable data prediction model is verified. Afterwards, the direction of the optimization practice for gradient descent was presented, and the educational effect of the proposed gradient descent method was analyzed through the results of satisfaction with education for non-majors.

Design Optimization of Thermal Radiation Shield Cooled by Cryocooler (냉동기에 의해 냉각되는 복사열차폐 최적설계)

  • Choi, Y.S.;Tang, Hongming;Kim, D.L.;Yang, H.S.;Lee, B.S.
    • Proceedings of the KSME Conference
    • /
    • 2008.11b
    • /
    • pp.2171-2174
    • /
    • 2008
  • The design of thermal radiation shield cooled by a cryocooler is presented. This study is motivated mainly by our recent development of prototype superconducting magnet system for the Cyclotron K120. The superconducting magnet system is composed of the magnet cryostat, transfer line and supply cryostat. In order to minimize thermal radiation load, the superconducting coil form in the magnet cryostat is enclosed by the thermal radiation shield which is thermally connected to the first-stage cold head of a two-stage cryocooler in the supply cryostat. Since the supply cryostat is located far from the magnet cryostat large temperature gradient along the thermal shield is unavoidable. In this paper, the thermal radiation shield is optimized to minimize temperature gradient with taking into account the cryogenic load, system structure and electrical load. The effect of heat source from thermal conduction through mechanical supports on the temperature distribution of thermal radiation shield is also discussed.

  • PDF

Medical Image Registration by Combining Gradient Vector Flow and Conditional Entropy Measure (기울기 벡터장과 조건부 엔트로피 결합에 의한 의료영상 정합)

  • Lee, Myung-Eun;Kim, Soo-Hyung;Kim, Sun-Worl;Lim, Jun-Sik
    • The KIPS Transactions:PartB
    • /
    • v.17B no.4
    • /
    • pp.303-308
    • /
    • 2010
  • In this paper, we propose a medical image registration technique combining the gradient vector flow and modified conditional entropy. The registration is conducted by the use of a measure based on the entropy of conditional probabilities. To achieve the registration, we first define a modified conditional entropy (MCE) computed from the joint histograms for the area intensities of two given images. In order to combine the spatial information into a traditional registration measure, we use the gradient vector flow field. Then the MCE is computed from the gradient vector flow intensity (GVFI) combining the gradient information and their intensity values of original images. To evaluate the performance of the proposed registration method, we conduct experiments with our method as well as existing method based on the mutual information (MI) criteria. We evaluate the precision of MI- and MCE-based measurements by comparing the registration obtained from MR images and transformed CT images. The experimental results show that the proposed method is faster and more accurate than other optimization methods.

A hybrid inverse method for small scale parameter estimation of FG nanobeams

  • Darabi, A.;Vosoughi, Ali R.
    • Steel and Composite Structures
    • /
    • v.20 no.5
    • /
    • pp.1119-1131
    • /
    • 2016
  • As a first attempt, an inverse hybrid numerical method for small scale parameter estimation of functionally graded (FG) nanobeams using measured frequencies is presented. The governing equations are obtained with the Eringen's nonlocal elasticity assumptions and the first-order shear deformation theory (FSDT). The equations are discretized by using the differential quadrature method (DQM). The discretized equations are transferred from temporal domain to frequency domain and frequencies of the nanobeam are obtained. By applying random error to these frequencies, measured frequencies are generated. The measured frequencies are considered as input data and inversely, the small scale parameter of the beam is obtained by minimizing a defined functional. The functional is defined as root mean square error between the measured frequencies and calculated frequencies by the DQM. Then, the conjugate gradient (CG) optimization method is employed to minimize the functional and the small scale parameter is obtained. Efficiency, convergence and accuracy of the presented hybrid method for small scale parameter estimation of the beams for different applied random error, boundary conditions, length-to-thickness ratio and volume fraction coefficients are demonstrated.

Optimal Die Profile Design in Tube Drawing Process for Prevention of Material Fracture (파단방지를 위한 튜브인발공정 최적 금형형상 설계에 관한 연구)

  • Lee, Sang-Kon;Kim, Sang-Woo;Lee, Young-Seon;Lee, Jung-Hwan;Kim, Byung-Min
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.23 no.11 s.188
    • /
    • pp.78-84
    • /
    • 2006
  • The objective of this study is to design the optimal die profile that can prevent material fracture in the tube drawing process for automobile steering input shaft. First, the CDV(Critical Damage Value) of material is obtained by the compression test and FE-analysis. The occurrence of fracture is estimated by the FE-analysis considering the CDV. In order to achieve the objective of this study, optimization technique and FE-analysis are applied. FPS(Flexible Polyhedron Search) method, which is one of the non-gradient optimization techniques often used in engineering, is used to search optimal die profile. The drawing die profile is represented by Bezier-curve to generate all the possible die profile. Using FPS method and FE-analysis the optimal drawing die profile is determined. To verify tile effectiveness of the redesigned optimal die, the tube drawing experiment is performed. In the experimental result, it is possible to produce sound product without material fracture using the redesigned optimal die.

Application of machine learning and deep neural network for wave propagation in lung cancer cell

  • Xing, Lumin;Liu, Wenjian;Li, Xin;Wang, Han;Jiang, Zhiming;Wang, Lingling
    • Advances in nano research
    • /
    • v.13 no.3
    • /
    • pp.297-312
    • /
    • 2022
  • Coughing and breath shortness are common symptoms of nano (small) cell lung cancer. Smoking is main factor in causing such cancers. The cancer cells form on the soft tissues of lung. Deformation behavior and wave vibration of lung affected when cancer cells exist. Therefore, in the current work, phase velocity behavior of the small cell lung cancer as a main part of the body via an exact size-dependent theory is presented. Regarding this problem, displacement fields of small cell lung cancer are obtained using first-order shear deformation theory with five parameters. Besides, the size-dependent small cell lung cancer is modeled via nonlocal stress/strain gradient theory (NSGT). An analytical method is applied for solving the governing equations of the small cell lung cancer structure. The novelty of the current study is the consideration of the five-parameter of displacement for curved panel, and porosity as well as NSGT are employed and solved using the analytical method. For more verification, the outcomes of this reports are compared with the predictions of deep neural network (DNN) with adaptive optimization method. A thorough parametric investigation is conducted on the effect of NSGT parameters, porosity and geometry on the phase velocity behavior of the small cell lung cancer structure.

Evaluation of Human Demonstration Augmented Deep Reinforcement Learning Policies via Object Manipulation with an Anthropomorphic Robot Hand (휴먼형 로봇 손의 사물 조작 수행을 이용한 사람 데모 결합 강화학습 정책 성능 평가)

  • Park, Na Hyeon;Oh, Ji Heon;Ryu, Ga Hyun;Lopez, Patricio Rivera;Anazco, Edwin Valarezo;Kim, Tae Seong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.5
    • /
    • pp.179-186
    • /
    • 2021
  • Manipulation of complex objects with an anthropomorphic robot hand like a human hand is a challenge in the human-centric environment. In order to train the anthropomorphic robot hand which has a high degree of freedom (DoF), human demonstration augmented deep reinforcement learning policy optimization methods have been proposed. In this work, we first demonstrate augmentation of human demonstration in deep reinforcement learning (DRL) is effective for object manipulation by comparing the performance of the augmentation-free Natural Policy Gradient (NPG) and Demonstration Augmented NPG (DA-NPG). Then three DRL policy optimization methods, namely NPG, Trust Region Policy Optimization (TRPO), and Proximal Policy Optimization (PPO), have been evaluated with DA (i.e., DA-NPG, DA-TRPO, and DA-PPO) and without DA by manipulating six objects such as apple, banana, bottle, light bulb, camera, and hammer. The results show that DA-NPG achieved the average success rate of 99.33% whereas NPG only achieved 60%. In addition, DA-NPG succeeded grasping all six objects while DA-TRPO and DA-PPO failed to grasp some objects and showed unstable performances.

THE SENSITIVITY OF STRUCTURAL RESPONSE USING FINITE ELEMENTS IN TIME

  • Park, Sungho;Kim, Seung-Jo
    • Journal of Theoretical and Applied Mechanics
    • /
    • v.3 no.1
    • /
    • pp.66-80
    • /
    • 2002
  • The bilinear formulation proposed earlier by Peters and Izadpanah to develop finite elements in time to solve undamped linear systems, Is extended (and found to be readily amenable) to develop time finite elements to obtain transient responses of both linear and nonlinear, and damped and undamped systems. The formulation Is used in the h-, p- and hp-versions. The resulting linear and nonlinear algebraic equations are differentiated to obtain the first- and second-order sensitivities of the transient response with respect to various system parameters. The present developments were tested on a series of linear and nonlinear examples and were found to yield, when compared with results obtained using other methods, excellent results for both the transient response and Its sensitivity to system parameters. Mostly. the results were obtained using the Legendre polynomials as basis functions, though. in some cases other orthogonal polynomials namely. the Hermite. the Chebyshev, and integrated Legendre polynomials were also employed (but to no great advantage). A key advantage of the time finite element method, and the one often overlooked in its past applications, is the ease In which the sensitivity of the transient response with respect to various system parameters can be obtained. The results of sensitivity analysis can be used for approximate schemes for efficient solution of design optimization problems. Also. the results can be applied to gradient-based parameter identification schemes.

  • PDF

Verification of Reduced Order Modeling based Uncertainty/Sensitivity Estimator (ROMUSE)

  • Khuwaileh, Bassam;Williams, Brian;Turinsky, Paul;Hartanto, Donny
    • Nuclear Engineering and Technology
    • /
    • v.51 no.4
    • /
    • pp.968-976
    • /
    • 2019
  • This paper presents a number of verification case studies for a recently developed sensitivity/uncertainty code package. The code package, ROMUSE (Reduced Order Modeling based Uncertainty/Sensitivity Estimator) is an effort to provide an analysis tool to be used in conjunction with reactor core simulators, in particular the Virtual Environment for Reactor Applications (VERA) core simulator. ROMUSE has been written in C++ and is currently capable of performing various types of parameter perturbations and associated sensitivity analysis, uncertainty quantification, surrogate model construction and subspace analysis. The current version 2.0 has the capability to interface with the Design Analysis Kit for Optimization and Terascale Applications (DAKOTA) code, which gives ROMUSE access to the various algorithms implemented within DAKOTA, most importantly model calibration. The verification study is performed via two basic problems and two reactor physics models. The first problem is used to verify the ROMUSE single physics gradient-based range finding algorithm capability using an abstract quadratic model. The second problem is the Brusselator problem, which is a coupled problem representative of multi-physics problems. This problem is used to test the capability of constructing surrogates via ROMUSE-DAKOTA. Finally, light water reactor pin cell and sodium-cooled fast reactor fuel assembly problems are simulated via SCALE 6.1 to test ROMUSE capability for uncertainty quantification and sensitivity analysis purposes.

Delay-Margin based Traffic Engineering for MPLS-DiffServ Networks

  • Ashour, Mohamed;Le-Ngoc, Tho
    • Journal of Communications and Networks
    • /
    • v.10 no.3
    • /
    • pp.351-361
    • /
    • 2008
  • This paper presents a delay-margin based traffic engineering (TE) approach to provide end-to-end quality of service (QoS) in multi-protocol label switching (MPLS) networks using differentiated services (DiffServ) at the link level. The TE, including delay, class, and route assignments, is formulated as a nonlinear optimization problem reflecting the inter-class and inter-link dependency introduced by DiffServ and end-to-end QoS requirements. Three algorithms are used to provide a solution to the problem: The first two, centralized offline route configuration and link-class delay assignment, operate in the convex areas of the feasible region to consecutively reduce the objective function using a per-link per-class decomposition of the objective function gradient. The third one is a heuristic that promotes/demotes connections at different links in order to deal with concave areas that may be produced by a trunk route usage of more than one class on a given link. Approximations of the three algorithms suitable for on-line distributed TE operation are also derived. Simulation is used to show that proposed approach can increase the number of users while maintaining end-to-end QoS requirements.