• Title/Summary/Keyword: Learning Control Algorithm

Search Result 947, Processing Time 0.033 seconds

Gain Tuning for SMCSPO of Robot Arm with Q-Learning (Q-Learning을 사용한 로봇팔의 SMCSPO 게인 튜닝)

  • Lee, JinHyeok;Kim, JaeHyung;Lee, MinCheol
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.2
    • /
    • pp.221-229
    • /
    • 2022
  • Sliding mode control (SMC) is a robust control method to control a robot arm with nonlinear properties. A high switching gain of SMC causes chattering problems, although the SMC allows the adequate control performance by giving high switching gain, without the exact robot model containing nonlinear and uncertainty terms. In order to solve this problem, SMC with sliding perturbation observer (SMCSPO) has been researched, where the method can reduce the chattering by compensating the perturbation, which is estimated by the observer, and then choosing a lower switching control gain of SMC. However, optimal gain tuning is necessary to get a better tracking performance and reducing a chattering. This paper proposes a method that the Q-learning automatically tunes the control gains of SMCSPO with an iterative operation. In this tuning method, the rewards of reinforcement learning (RL) are set minus tracking errors of states, and the action of RL is a change of control gain to maximize rewards whenever the iteration number of movements increases. The simple motion test for a 7-DOF robot arm was simulated in MATLAB program to prove this RL tuning algorithm. The simulation showed that this method can automatically tune the control gains for SMCSPO.

Adaptive fuzzy learning control for a class of second order nonlinear dynamic systems

  • Park, B.H.;Lee, Jin S.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1996.10a
    • /
    • pp.103-106
    • /
    • 1996
  • This paper presents an iterative fuzzy learning control scheme which is applicable to a broad class of nonlinear systems. The control scheme achieves system stability and boundedness by using the linear feedback plus adaptive fuzzy controller and achieves precise tracking by using the iterative learning rules. The switching mode control unit is added to the adaptive fuzzy controller in order to compensate for the error that has been inevitably introduced from the fuzzy approximation of the nonlinear part. It also obviates any supervisory control action in the adaptive fuzzy controller which normally requires high gain signal. The learning control algorithm obviates any output derivative terms which are vulnerable to noise.

  • PDF

Control for Multi-variable in Crane System using Fuzzy Learning Method (퍼지학습법을 이용한 크레인 시스템의 다변수 제어)

  • Lim, Yoon-Kyu;Chung, Byeong-Mook
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.16 no.7
    • /
    • pp.144-150
    • /
    • 1999
  • n active control for the swing of crane systems is very important for increasing the productivity. This article introduces the control for the position and the swing of a crane using the fuzzy learning method. Because the crane is a multi-variable system, learning is done to control both position and swing of the crane. Also the fuzzy control rules are separately acquired with the loading and unloading situation of the crane for more accurate control. The result of simulations shows that the crane is just controlled for a very large swing angle of 1 radian within nearly one cycle.

  • PDF

PID Type Iterative Learning Control with Optimal Gains

  • Madady, Ali
    • International Journal of Control, Automation, and Systems
    • /
    • v.6 no.2
    • /
    • pp.194-203
    • /
    • 2008
  • Iterative learning control (ILC) is a simple and effective method for the control of systems that perform the same task repetitively. ILC algorithm uses the repetitiveness of the task to track the desired trajectory. In this paper, we propose a PID (proportional plus integral and derivative) type ILC update law for control discrete-time single input single-output (SISO) linear time-invariant (LTI) systems, performing repetitive tasks. In this approach, the input of controlled system in current cycle is modified by applying the PID strategy on the error achieved between the system output and the desired trajectory in a last previous iteration. The convergence of the presented scheme is analyzed and its convergence condition is obtained in terms of the PID coefficients. An optimal design method is proposed to determine the PID coefficients. It is also shown that under some given conditions, this optimal iterative learning controller can guarantee the monotonic convergence. An illustrative example is given to demonstrate the effectiveness of the proposed technique.

Dynamic System Identification Using a Recurrent Compensatory Fuzzy Neural Network

  • Lee, Chi-Yung;Lin, Cheng-Jian;Chen, Cheng-Hung;Chang, Chun-Lung
    • International Journal of Control, Automation, and Systems
    • /
    • v.6 no.5
    • /
    • pp.755-766
    • /
    • 2008
  • This study presents a recurrent compensatory fuzzy neural network (RCFNN) for dynamic system identification. The proposed RCFNN uses a compensatory fuzzy reasoning method, and has feedback connections added to the rule layer of the RCFNN. The compensatory fuzzy reasoning method can make the fuzzy logic system more effective, and the additional feedback connections can solve temporal problems as well. Moreover, an online learning algorithm is demonstrated to automatically construct the RCFNN. The RCFNN initially contains no rules. The rules are created and adapted as online learning proceeds via simultaneous structure and parameter learning. Structure learning is based on the measure of degree and parameter learning is based on the gradient descent algorithm. The simulation results from identifying dynamic systems demonstrate that the convergence speed of the proposed method exceeds that of conventional methods. Moreover, the number of adjustable parameters of the proposed method is less than the other recurrent methods.

Dynamic GBFCM(Gradient Based FCM) Algorithm (동적 GBFCM(Gradient Based FCM) 알고리즘)

  • Kim, Myoung-Ho;Park, Dong-C.
    • Proceedings of the KIEE Conference
    • /
    • 1996.07b
    • /
    • pp.1371-1373
    • /
    • 1996
  • A clustering algorithms with dynamic adjustment of learning rate for GBFCM(Gradient Based FCM) is proposed in this paper. This algorithm combines two idea of dynamic K-means algorithms and GBFCM : learning rate variation with entropy concept and continuous membership grade. To evaluate dynamic GBFCM, we made comparisons with Kohonen's Self-Organizing Map over several tutorial examples and image compression. The results show that DGBFCM(Dynamic GBFCM) gives superior performance over Kohonen's algorithm in terms of signal-to-noise.

  • PDF

A Voice Controlled Service Robot Using Support Vector Machine

  • Kim, Seong-Rock;Park, Jae-Suk;Park, Ju-Hyun;Lee, Suk-Gyu
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1413-1415
    • /
    • 2004
  • This paper proposes a SVM(Support Vector Machine) training algorithm to control a service robot with voice command. The service robot with a stereo vision system and dual manipulators of four degrees of freedom implements a User-Dependent Voice Control System. The training of SVM algorithm that is one of the statistical learning theories leads to a QP(quadratic programming) problem. In this paper, we present an efficient SVM speech recognition scheme especially based on less learning data comparing with conventional approaches. SVM discriminator decides rejection or acceptance of user's extracted voice features by the MFCC(Mel Frequency Cepstrum Coefficient). Among several SVM kernels, the exponential RBF function gives the best classification and the accurate user recognition. The numerical simulation and the experiment verified the usefulness of the proposed algorithm.

  • PDF

Machine learning approaches for wind speed forecasting using long-term monitoring data: a comparative study

  • Ye, X.W.;Ding, Y.;Wan, H.P.
    • Smart Structures and Systems
    • /
    • v.24 no.6
    • /
    • pp.733-744
    • /
    • 2019
  • Wind speed forecasting is critical for a variety of engineering tasks, such as wind energy harvesting, scheduling of a wind power system, and dynamic control of structures (e.g., wind turbine, bridge, and building). Wind speed, which has characteristics of random, nonlinear and uncertainty, is difficult to forecast. Nowadays, machine learning approaches (generalized regression neural network (GRNN), back propagation neural network (BPNN), and extreme learning machine (ELM)) are widely used for wind speed forecasting. In this study, two schemes are proposed to improve the forecasting performance of machine learning approaches. One is that optimization algorithms, i.e., cross validation (CV), genetic algorithm (GA), and particle swarm optimization (PSO), are used to automatically find the optimal model parameters. The other is that the combination of different machine learning methods is proposed by finite mixture (FM) method. Specifically, CV-GRNN, GA-BPNN, PSO-ELM belong to optimization algorithm-assisted machine learning approaches, and FM is a hybrid machine learning approach consisting of GRNN, BPNN, and ELM. The effectiveness of these machine learning methods in wind speed forecasting are fully investigated by one-year field monitoring data, and their performance is comprehensively compared.

Stable Tracking Control to a Non-linear Process Via Neural Network Model

  • Zhai, Yujia
    • Journal of the Korea Convergence Society
    • /
    • v.5 no.4
    • /
    • pp.163-169
    • /
    • 2014
  • A stable neural network control scheme for unknown non-linear systems is developed in this paper. While the control variable is optimised to minimize the performance index, convergence of the index is guaranteed asymptotically stable by a Lyapnov control law. The optimization is achieved using a gradient descent searching algorithm and is consequently slow. A fast convergence algorithm using an adaptive learning rate is employed to speed up the convergence. Application of the stable control to a single input single output (SISO) non-linear system is simulated. The satisfactory control performance is obtained.

Fuzzy Learning Control for Ball & Beam System (볼과 빔 시스템의 퍼지 학습 제어)

  • Joo, Hae-Ho;Jung, Byung-Mook;Lee, Jae-Won;Lee, Hwa-Jo;Lee, Young
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 1996.11a
    • /
    • pp.439-443
    • /
    • 1996
  • A fuzzy teaming controller is experimentally designed to control the ball k beam system in this paper. Although most fuzzy controllers have been built just to emulate human decision-making behavior, it is necessary to construct the rule bases by using a learning method with self-improvement when it is difficult or impossible to get them only by expert's experience. The algorithm introduces a reference model to generate a desired output and minimizes a performance index function based on the error and error-rate using the gradient-decent method. In our balancing experiment of the ball & beam system, this paper shows that the fuzzy control rules by learning are superior to the expert's experience.

  • PDF