• 제목/요약/키워드: Adaptive Learning Algorithm

검색결과 391건 처리시간 0.025초

운반차-막대 시스템을 위한 적응비평학습에 의한 CMAC 제어계 (CMAC Controller with Adaptive Critic Learning for Cart-Pole System)

  • 권성규
    • 한국지능시스템학회논문지
    • /
    • 제10권5호
    • /
    • pp.466-477
    • /
    • 2000
  • 이 논문에서는 운반차-막대 시스템을 제어하기 위한 CMAC을 이용한 적응 학습 제어계를 개발하기 위하여, 적응비평학습을 이용하는 신경망 제어계에 관한 여러 연구 문헌들을 조사하고, ASE 요소를 이용하는 적응비평학습 기법을 CMAC을 바탕으로 하는 제어계에 통합하였다. 적응비평학습 기법을 CMAC에 구현하는데 있어서의 변환 문제를 검토하고, CMAC 제어계와 ASE 제어계가 운반차-막대 문제를 학습하는 속도를 비교하여, CMAC 제어계의 학습 속도가 빠르기는 하지만, 입력 공간의 더 넓은 영역에 대해서는 학습효과를 발휘하지 못하는 문제의 관점에서 적응비평학습 방법이 CMAC의 특성과 어울리는지를 고찰하였다.

  • PDF

Adaptive Fuzzy Neural Control of Unknown Nonlinear Systems Based on Rapid Learning Algorithm

  • Kim, Hye-Ryeong;Kim, Jae-Hun;Kim, Euntai;Park, Mignon
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2003년도 추계 학술대회 학술발표 논문집
    • /
    • pp.95-98
    • /
    • 2003
  • In this paper, an adaptive fuzzy neural control of unknown nonlinear systems based on the rapid learning algorithm is proposed for optimal parameterization. We combine the advantages of fuzzy control and neural network techniques to develop an adaptive fuzzy control system for updating nonlinear parameters of controller. The Fuzzy Neural Network(FNN), which is constructed by an equivalent four-layer connectionist network, is able to learn to control a process by updating the membership functions. The free parameters of the AFN controller are adjusted on-line according to the control law and adaptive law for the purpose of controlling the plant track a given trajectory and it's initial values are off-line preprocessing, In order to improve the convergence of the learning process, we propose a rapid learning algorithm which combines the error back-propagation algorithm with Aitken's $\delta$$\^$2/ algorithm. The heart of this approach ls to reduce the computational burden during the FNN learning process and to improve convergence speed. The simulation results for nonlinear plant demonstrate the control effectiveness of the proposed system for optimal parameterization.

  • PDF

A3C 기반의 강화학습을 사용한 DASH 시스템 (A DASH System Using the A3C-based Deep Reinforcement Learning)

  • 최민제;임경식
    • 대한임베디드공학회논문지
    • /
    • 제17권5호
    • /
    • pp.297-307
    • /
    • 2022
  • The simple procedural segment selection algorithm commonly used in Dynamic Adaptive Streaming over HTTP (DASH) reveals severe weakness to provide high-quality streaming services in the integrated mobile networks of various wired and wireless links. A major issue could be how to properly cope with dynamically changing underlying network conditions. The key to meet it should be to make the segment selection algorithm much more adaptive to fluctuation of network traffics. This paper presents a system architecture that replaces the existing procedural segment selection algorithm with a deep reinforcement learning algorithm based on the Asynchronous Advantage Actor-Critic (A3C). The distributed A3C-based deep learning server is designed and implemented to allow multiple clients in different network conditions to stream videos simultaneously, collect learning data quickly, and learn asynchronously, resulting in greatly improved learning speed as the number of video clients increases. The performance analysis shows that the proposed algorithm outperforms both the conventional DASH algorithm and the Deep Q-Network algorithm in terms of the user's quality of experience and the speed of deep learning.

Active Random Noise Control using Adaptive Learning Rate Neural Networks

  • Sasaki, Minoru;Kuribayashi, Takumi;Ito, Satoshi
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.941-946
    • /
    • 2005
  • In this paper an active random noise control using adaptive learning rate neural networks is presented. The adaptive learning rate strategy increases the learning rate by a small constant if the current partial derivative of the objective function with respect to the weight and the exponential average of the previous derivatives have the same sign, otherwise the learning rate is decreased by a proportion of its value. The use of an adaptive learning rate attempts to keep the learning step size as large as possible without leading to oscillation. It is expected that a cost function minimize rapidly and training time is decreased. Numerical simulations and experiments of active random noise control with the transfer function of the error path will be performed, to validate the convergence properties of the adaptive learning rate Neural Networks. Control results show that adaptive learning rate Neural Networks control structure can outperform linear controllers and conventional neural network controller for the active random noise control.

  • PDF

Adaptive Learning Path Recommendation based on Graph Theory and an Improved Immune Algorithm

  • BIAN, Cun-Ling;WANG, De-Liang;LIU, Shi-Yu;LU, Wei-Gang;DONG, Jun-Yu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권5호
    • /
    • pp.2277-2298
    • /
    • 2019
  • Adaptive learning in e-learning has garnered researchers' interest. In it, learning resources could be recommended automatically to achieve a personalized learning experience. There are various ways to realize it. One of the realistic ways is adaptive learning path recommendation, in which learning resources are provided according to learners' requirements. This paper summarizes existing works and proposes an innovative approach. Firstly, a learner-centred concept map is created using graph theory based on the features of the learners and concepts. Then, the approach generates a linear concept sequence from the concept map using the proposed traversal algorithm. Finally, Learning Objects (LOs), which are the smallest concrete units that make up a learning path, are organized based on the concept sequences. In order to realize this step, we model it as a multi-objective combinatorial optimization problem, and an improved immune algorithm (IIA) is proposed to solve it. In the experimental stage, a series of simulated experiments are conducted on nine datasets with different levels of complexity. The results show that the proposed algorithm increases the computational efficiency and effectiveness. Moreover, an empirical study is carried out to validate the proposed approach from a pedagogical view. Compared with a self-selection based approach and the other evolutionary algorithm based approaches, the proposed approach produces better outcomes in terms of learners' homework, final exam grades and satisfaction.

다중 학습 알고리듬을 이용한 평면형 병렬 매니퓰레이터의 Fuzzy 논리 제어 (Fuzzy logic control of a planar parallel manipulator using multi learning algorithm)

  • 송낙윤;조황
    • 제어로봇시스템학회논문지
    • /
    • 제5권8호
    • /
    • pp.914-922
    • /
    • 1999
  • A study on the improvement of tracking performance of a 3 DOF planar parallel manipulator is performed. A class of adaptive tracking control sheme is designed using self tuning adaptive fuzzy logic control theory. This control sheme is composed of three classical PD controller and a multi learning type self tuning adaptive fuzzy logic controller set. PD controller is tuned roughly by manual setting a priori and fuzzy logic controller is tuned precisely by the gradient descent method for a global solution during run-time, so the proposed control scheme is tuned more rapidly and precisely than the single learning type self tuning adaptive fuzzy logic control sheme for a local solution. The control performance of the proposed algorithm is verified through experiments.

  • PDF

적응 뉴럴 컴퓨팅 방법을 이용한 동적 시스템의 특성 모델링 (Characteristics Modeling of Dynamic Systems Using Adaptive Neural Computation)

  • 김병호
    • 제어로봇시스템학회논문지
    • /
    • 제13권4호
    • /
    • pp.309-314
    • /
    • 2007
  • This paper presents an adaptive neural computation algorithm for multi-layered neural networks which are applied to identify the characteristic function of dynamic systems. The main feature of the proposed algorithm is that the initial learning rate for the employed neural network is assigned systematically, and also the assigned learning rate can be adjusted empirically for effective neural leaning. By employing the approach, enhanced modeling of dynamic systems is possible. The effectiveness of this approach is veri tied by simulations.

유전 알고리즘과 시뮬레이티드 어닐링이 적용된 적응 랜덤 신호 기반 학습에 관한 연구 (A Study on Adaptive Random Signal-Based Learning Employing Genetic Algorithms and Simulated Annealing)

  • 한창욱;박정일
    • 제어로봇시스템학회논문지
    • /
    • 제7권10호
    • /
    • pp.819-826
    • /
    • 2001
  • Genetic algorithms are becoming more popular because of their relative simplicity and robustness. Genetic algorithms are global search techniques for nonlinear optimization. However, traditional genetic algorithms, though robust, are generally not the most successful optimization algorithm on any particular domain because they are poor at hill-climbing, whereas simulated annealing has the ability of probabilistic hill-climbing. Therefore, hybridizing a genetic algorithm with other algorithms can produce better performance than using the genetic algorithm or other algorithms independently. In this paper, we propose an efficient hybrid optimization algorithm named the adaptive random signal-based learning. Random signal-based learning is similar to the reinforcement learning of neural networks. This paper describes the application of genetic algorithms and simulated annealing to a random signal-based learning in order to generate the parameters and reinforcement signal of the random signal-based learning, respectively. The validity of the proposed algorithm is confirmed by applying it to two different examples.

  • PDF

대규모 광학적 구현을 위한 TAG 신경회로망 모델 (TAG neural network model for large-sized optical implementation)

  • 이혁재
    • 한국광학회:학술대회논문집
    • /
    • 한국광학회 1991년도 제6회 파동 및 레이저 학술발표회 Prodeedings of 6th Conference on Waves and Lasers
    • /
    • pp.35-40
    • /
    • 1991
  • In this paper, a new adaptive learning algorithm, Training by Adaptive Gain (TAG) for optical implementation of large-sized neural networks has been developed and its electro-optical implementation for 2-dimensional input and output neurons has been demostrated. The 4-dimensional global fixed interconnections and 2-dimensional adaptive gain-controls are implemented by multi-facet computer generated holograms and LCTV spatial light modulators, respectively. When the input signals pass through optical system to the output classifying layer, the TAG adaptive learning algorithm is implemented by a personal computer. The system classifies three 5$\times$5 input patterns correctly.

  • PDF

DOA 기반 학습률 조절을 이용한 다채널 음성개선 알고리즘 (Multi-Channel Speech Enhancement Algorithm Using DOA-based Learning Rate Control)

  • 김수환;이영재;김영일;정상배
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.91-98
    • /
    • 2011
  • In this paper, a multi-channel speech enhancement method using the linearly constrained minimum variance (LCMV) algorithm and a variable learning rate control is proposed. To control the learning rate for adaptive filters of the LCMV algorithm, the direction of arrival (DOA) is measured for each short-time input signal and the likelihood function of the target speech presence is estimated to control the filter learning rate. Using the likelihood measure, the learning rate is increased during the pure noise interval and decreased during the target speech interval. To optimize the parameter of the mapping function between the likelihood value and the corresponding learning rate, an exhaustive search is performed using the Bark's scale distortion (BSD) as the performance index. Experimental results show that the proposed algorithm outperforms the conventional LCMV with fixed learning rate in the BSD by around 1.5 dB.

  • PDF