• 제목/요약/키워드: Learning Control Algorithm

검색결과 944건 처리시간 0.035초

유전자 알고리즘과 학습제어를 이용한 이족보행 로봇의 지능 제어기 구현 (Implementation of an Intelligent Controller for Biped Walking Robot using Genetic Algorithm and Learning Control)

  • 고재원;임동철
    • 전기학회논문지P
    • /
    • 제55권2호
    • /
    • pp.83-88
    • /
    • 2006
  • This paper proposes a method that minimizes the consumed energy by searching the optimal locations of the mass centers of the biped robot's links using Genetic Algorithm. This paper presents a learning controller for repetitive gait control of the biped robot. The learning control scheme consists of a feedforward learning nile and linear feedback control input for stabilization of learning system. The feasibility of learning control to the biped robotic motion is shown via computer simulation and experimental results with 24 DOF biped walking robot.

Deep Q-Network를 이용한 준능동 제어알고리즘 개발 (Development of Semi-Active Control Algorithm Using Deep Q-Network)

  • 김현수;강주원
    • 한국공간구조학회논문집
    • /
    • 제21권1호
    • /
    • pp.79-86
    • /
    • 2021
  • Control performance of a smart tuned mass damper (TMD) mainly depends on control algorithms. A lot of control strategies have been proposed for semi-active control devices. Recently, machine learning begins to be applied to development of vibration control algorithm. In this study, a reinforcement learning among machine learning techniques was employed to develop a semi-active control algorithm for a smart TMD. The smart TMD was composed of magnetorheological damper in this study. For this purpose, an 11-story building structure with a smart TMD was selected to construct a reinforcement learning environment. A time history analysis of the example structure subject to earthquake excitation was conducted in the reinforcement learning procedure. Deep Q-network (DQN) among various reinforcement learning algorithms was used to make a learning agent. The command voltage sent to the MR damper is determined by the action produced by the DQN. Parametric studies on hyper-parameters of DQN were performed by numerical simulations. After appropriate training iteration of the DQN model with proper hyper-parameters, the DQN model for control of seismic responses of the example structure with smart TMD was developed. The developed DQN model can effectively control smart TMD to reduce seismic responses of the example structure.

Adaptive Fuzzy Neural Control of Unknown Nonlinear Systems Based on Rapid Learning Algorithm

  • Kim, Hye-Ryeong;Kim, Jae-Hun;Kim, Euntai;Park, Mignon
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2003년도 추계 학술대회 학술발표 논문집
    • /
    • pp.95-98
    • /
    • 2003
  • In this paper, an adaptive fuzzy neural control of unknown nonlinear systems based on the rapid learning algorithm is proposed for optimal parameterization. We combine the advantages of fuzzy control and neural network techniques to develop an adaptive fuzzy control system for updating nonlinear parameters of controller. The Fuzzy Neural Network(FNN), which is constructed by an equivalent four-layer connectionist network, is able to learn to control a process by updating the membership functions. The free parameters of the AFN controller are adjusted on-line according to the control law and adaptive law for the purpose of controlling the plant track a given trajectory and it's initial values are off-line preprocessing, In order to improve the convergence of the learning process, we propose a rapid learning algorithm which combines the error back-propagation algorithm with Aitken's $\delta$$\^$2/ algorithm. The heart of this approach ls to reduce the computational burden during the FNN learning process and to improve convergence speed. The simulation results for nonlinear plant demonstrate the control effectiveness of the proposed system for optimal parameterization.

  • PDF

선삭에서 비원형 단면 가공을 위한 제어 연구 (A Learning Control Algorithm for Noncircular Cutting with Lathe)

  • 이재규;오창진;김옥현
    • 한국정밀공학회지
    • /
    • 제12권6호
    • /
    • pp.96-104
    • /
    • 1995
  • A study for a lathe to machine workpiece with noncircular cross-section is presented. The noncircular cutting is accomplished by controlling radial tool position synchronized with revolution angle of the spindle according to the desired cross-sectional shape. A learning control algorithm is suggested for the tool positioning. The learning law of the algorithm is based on pole-zero cancellation, which guarantees the control stability. The control performances are analyzed and simulated on a numerical computer that the effectiveness of the control algorithm is convinced. The algorithm is tested on a conventional NC-lathe which shows some successful results.

  • PDF

Fault-tolerant control system for once-through steam generator based on reinforcement learning algorithm

  • Li, Cheng;Yu, Ren;Yu, Wenmin;Wang, Tianshu
    • Nuclear Engineering and Technology
    • /
    • 제54권9호
    • /
    • pp.3283-3292
    • /
    • 2022
  • Based on the Deep Q-Network(DQN) algorithm of reinforcement learning, an active fault-tolerance method with incremental action is proposed for the control system with sensor faults of the once-through steam generator(OTSG). In this paper, we first establish the OTSG model as the interaction environment for the agent of reinforcement learning. The reinforcement learning agent chooses an action according to the system state obtained by the pressure sensor, the incremental action can gradually approach the optimal strategy for the current fault, and then the agent updates the network by different rewards obtained in the interaction process. In this way, we can transform the active fault tolerant control process of the OTSG to the reinforcement learning agent's decision-making process. The comparison experiments compared with the traditional reinforcement learning algorithm(RL) with fixed strategies show that the active fault-tolerant controller designed in this paper can accurately and rapidly control under sensor faults so that the pressure of the OTSG can be stabilized near the set-point value, and the OTSG can run normally and stably.

비선형 시스템에 적용가능한 피드백 사용형 2차 반복 학습제어 알고리즘 (A Second-Order Iterative Learning Algorithm with Feedback Applicable to Nonlinear Systems)

  • 허경무;우광준
    • 제어로봇시스템학회논문지
    • /
    • 제4권5호
    • /
    • pp.608-615
    • /
    • 1998
  • In this paper a second-order iterative learning control algorithm with feedback is proposed for the trajectory-tracking control of nonlinear dynamic systems with unidentified parameters. In contrast to other known methods, the proposed teaming control scheme utilize more than one past error history contained in the trajectories generated at prior iterations, and a feedback term is added in the learning control scheme for the enhancement of convergence speed and robustness to disturbances or system parameter variations. The convergence proof of the proposed algorithm is given in detail, and the sufficient condition for the convergence of the algorithm is provided. We also discuss the convergence performance of the algorithm when the initial condition at the beginning of each iteration differs from the previous value of the initial condition. The effectiveness of the proposed algorithm is shown by computer simulation result. It is shown that, by adding a feedback term in teaming control algorithm, convergence speed, robustness to disturbances and robustness to unmatched initial conditions can be improved.

  • PDF

Discrete-time learning control for robotic manipulators

  • Suzuki, Tatsuya;Yasue, Masanori;Okuma, Shigeru;Uchikawa, Yoshiki
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1989년도 한국자동제어학술회의논문집; Seoul, Korea; 27-28 Oct. 1989
    • /
    • pp.1069-1074
    • /
    • 1989
  • A discrete-time learning control for robotic manipulators is studied using its pulse transfer function. Firstly, discrete-time learning stability condition which is applicable to single-input two-outputs systems is derived. Secondly, stability of learning algorithm with position signal is studied. In this case, when sampling period is small, the algorithm is not stable because of an unstable zero of the system. Thirdly, stability of algorithm with position and velocity signals is studied. In this case, we can stabilize the learning control system which is unstable in learning with only position signal. Finally, simulation results on the trajectory control of robotic manipulators using the discrete-time learning control are shown. This simulation results agree well with the analytical ones.

  • PDF

최적의 퍼지제어규칙을 얻기위한 퍼지학습법 (A Learning Algorithm for Optimal Fuzzy Control Rules)

  • 정병묵
    • 대한기계학회논문집A
    • /
    • 제20권2호
    • /
    • pp.399-407
    • /
    • 1996
  • A fuzzy learning algorithm to get the optimal fuzzy rules is presented in this paper. The algorithm introduces a reference model to generate a desired output and a performance index funtion instead of the performance index table. The performance index funtion is a cost function based on the error and error-rate between the reference and plant output. The cost function is minimized by a gradient method and the control input is also updated. In this case, the control rules which generate the desired response can be obtained by changing the portion of the error-rate in the cost funtion. In SISO(Single-Input Single- Output)plant, only by the learning delay, it is possible to experss the plant model and to get the desired control rules. In the long run, this algorithm gives us the good control rules with a minimal amount of prior informaiton about the environment.

Optimal Learning of Neo-Fuzzy Structure Using Bacteria Foraging Optimization

  • Kim, Dong-Hwa
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.1716-1722
    • /
    • 2005
  • Fuzzy logic, neural network, fuzzy-neural network play an important as the key technology of linguistic modeling for intelligent control and decision in complex systems. The fuzzy-neural network (FNN) learning represents one of the most effective algorithms to build such linguistic models. This paper proposes bacteria foraging algorithm based optimal learning fuzzy-neural network (BA-FNN). The proposed learning scheme is the fuzzy-neural network structure which can handle linguistic knowledge as tuning membership function of fuzzy logic by bacteria foraging algorithm. The learning algorithm of the BA-FNN is composed of two phases. The first phase is to find the initial membership functions of the fuzzy neural network model. In the second phase, bacteria foraging algorithm is used for tuning of membership functions of the proposed model.

  • PDF

지도학습과 강화학습을 이용한 준능동 중간층면진시스템의 최적설계 (Optimal Design of Semi-Active Mid-Story Isolation System using Supervised Learning and Reinforcement Learning)

  • 강주원;김현수
    • 한국공간구조학회논문집
    • /
    • 제21권4호
    • /
    • pp.73-80
    • /
    • 2021
  • A mid-story isolation system was proposed for seismic response reduction of high-rise buildings and presented good control performance. Control performance of a mid-story isolation system was enhanced by introducing semi-active control devices into isolation systems. Seismic response reduction capacity of a semi-active mid-story isolation system mainly depends on effect of control algorithm. AI(Artificial Intelligence)-based control algorithm was developed for control of a semi-active mid-story isolation system in this study. For this research, an practical structure of Shiodome Sumitomo building in Japan which has a mid-story isolation system was used as an example structure. An MR (magnetorheological) damper was used to make a semi-active mid-story isolation system in example model. In numerical simulation, seismic response prediction model was generated by one of supervised learning model, i.e. an RNN (Recurrent Neural Network). Deep Q-network (DQN) out of reinforcement learning algorithms was employed to develop control algorithm The numerical simulation results presented that the DQN algorithm can effectively control a semi-active mid-story isolation system resulting in successful reduction of seismic responses.