• 제목/요약/키워드: Decentralized Learning

검색결과 44건 처리시간 0.018초

A study on Indirect Adaptive Decentralized Learning Control of the Vertical Multiple Dynamic System

  • Lee, Soo-Cheol;Park, Seok-Sun;Lee, Jeh-Won
    • International Journal of Precision Engineering and Manufacturing
    • /
    • 제7권1호
    • /
    • pp.62-66
    • /
    • 2006
  • The learning control develops controllers that learn to improve their performance at executing a given task, based on experience performing this specific task. In a previous work, the authors presented an iterative precision of linear decentralized learning control based on p-integrated learning method for the vertical dynamic multiple systems. This paper develops an indirect decentralized learning control based on adaptive control method. The original motivation of the learning control field was learning in robots doing repetitive tasks such as an assembly line works. This paper starts with decentralized discrete time systems, and progresses to the robot application, modeling the robot as a time varying linear system in the neighborhood of the nominal trajectory, and using the usual robot controllers that are decentralized, treating each link as if it is independent of any coupling with other links. Some techniques will show up in the numerical simulation for vertical dynamic robot. The methods of learning system are shown for the iterative precision of each link.

Indirect Decentralized Repetitive Control for the Multiple Dynamic Subsystems

  • Lee, Soo-Cheol
    • 대한산업공학회지
    • /
    • 제23권1호
    • /
    • pp.1-22
    • /
    • 1997
  • Learning control refers to controllers that learn to improve their performance at executing a given task, based on experience performing this specific task. In a previous work, the authors presented a theory of indirect decentralized learning control based on use of indirect adaptive control concepts employing simultaneous identification and control. This paper extends these results to apply to the indirect repetitive control problem in which a periodic (i.e., repetitive) command is given to a control system. Decentralized indirect repetitive control algorithms are presented that have guaranteed convergence to zero tracking error under very general conditions. The original motivation of the repetitive control and learning control fields was learning in robots doing repetitive tasks such as on an assembly line. This paper starts with decentralized discrete time systems, and progresses to the robot application, modeling the robot as a time varying linear system in the neighborhood of the desired trajectory. Decentralized repetitive control is natural for this application because the feedback control for link rotations is normally implemented in a decentralized manner, treating each link as if it is independent of the other links.

  • PDF

수직다물체시스템의 간접적응형 분산학습제어에 관한 연구 (A Study on Indirect Adaptive Decentralized Learning Control of the Vertical Multiple Dynamic System)

  • 이수철;박석순;이재원
    • 한국정밀공학회지
    • /
    • 제22권4호
    • /
    • pp.92-98
    • /
    • 2005
  • The learning control develops controllers that learn to improve their performance at executing a given task, based on experience performing this specific task. In a previous work, the authors presented an iterative precision of linear decentralized learning control based on p-integrated learning method for the vertical dynamic multiple systems. This paper develops an indirect decentralized teaming control based on adaptive control method. The original motivation of the teaming control field was loaming in robots doing repetitive tasks such as on an assembly line. This paper starts with decentralized discrete time systems, and progresses to the robot application, modeling the robot as a time varying linear system in the neighborhood of the nominal trajectory, and using the usual robot controllers that are decentralized, treating each link as if it is independent of any coupling with other links. Some techniques will show up in the numerical simulation for vertical dynamic robot. The methods of learning system are shown up for the iterative precision of each link.

복합시스템을 위한 간접분산학습제어 (Indirect Decentralized Learning Control for the Multiple Systems)

  • Lee, Soo-Cheol
    • 한국정보시스템학회:학술대회논문집
    • /
    • 한국정보시스템학회 1996년도 추계학술발표회 발표논문집
    • /
    • pp.217-227
    • /
    • 1996
  • The new field of learning control develops controllers that learn to improve their performance at executing a given task, based on experience performin this specific task. In a previous work[6], the authors presented a theory of indirect learning control based on use of indirect adaptive control concepts employing simultaneous identification ad control. This paper develops improved indirect learning control algorithms, and studies the use of such controllers in decentralized systems. The original motivation of the learning control field was learning in robots doing repetitive tasks such as on an assembly line. This paper starts with decentralized discrete time systems, and progresses to the robot application, modeling the robot as a time varying linear system in the neighborhood of the nominal trajectory, and using the usual robot controllers that are decentralized, treating each link as if it is independent of any coupling with other links. The basic result of the paper is to show that stability of the indirect learning controllers for all subsystems when the coupling between subsystems is turned off, assures convergence to zero tracking error of the decentralized indirect learning control of the coupled system, provided that the sample time in the digital learning controller is sufficiently short.

  • PDF

Decentralized learning automata for control of unknown markov chains

  • Hara, Motoshi;Abe, Kenichi
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1990년도 한국자동제어학술회의논문집(국제학술편); KOEX, Seoul; 26-27 Oct. 1990
    • /
    • pp.1234-1239
    • /
    • 1990
  • In this paper, we propose a new type of decentralized learning automata for the control finite state Markov chains with unknown transition probabilities and rewards. In our scheme a .betha.-type learning automaton is associated with each state in which two or more actions(desisions) are available. In this decentralized learning automata system, each learning automaton operates, requiring only local information, to improve its performance under local environment. From simulation results, it is shown that the decentralized learning automata will converge to the optimal policy that produces the most highly total expected reward with discounting in all initiall states.

  • PDF

Linear decentralized learning control for the robot moving on the horizontal plane

  • Lee, Soo-Cheol
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 1995년도 춘계공동학술대회논문집; 전남대학교; 28-29 Apr. 1995
    • /
    • pp.869-879
    • /
    • 1995
  • The new field of learning control develops controllers that learn to improve their performance at executing a given task, based on experience performing this task. The simplest forms of learning control are based on the same concept as integral control, but operating in the domain of the repetitions of the task. In the previous paper, I had studied the use of such controllers in a decentralized system, such as a robot with the controller for each link acting independently. The basic result of the paper is to show that stability of the learning controllers for all subsystems when the coupling between subsystems is turned off, assures stability of the decentralized learning in the coupled system, provided that the sample time in the digital learning controller is sufficiently short. In this paper, we present two examples. The first illustrates the effect of coupling between subsystems in the system dynamics, and the second studies the application of decentralized learning control to robot problems. The latter example illustrates the application of decentralized learning control to nonlinear systems, and also studies the effect of the coupling between subsystems introduced in the input matrix by the discretization of the system equations. The conclusion is that for sufficiently small learning gain, and sufficiently small sample time, the simple learning control law based on integral control applied to each robot axis will produce zero tracking error in spite o the dynamic coupling in the robot equations. Of course, the results of this paper have much more general application than just to the robotics tracking problem. Convergence in decentralized systems is seen to depend only on the input and output matrices, provided the sample time is suffiently small.

  • PDF

연합학습의 인센티브 플랫폼으로써 이더리움 스마트 컨트랙트를 시행하는 경우의 실무적 고려사항 (Practical Concerns in Enforcing Ethereum Smart Contracts as a Rewarding Platform in Decentralized Learning)

  • ;;장설아;이경현
    • 정보처리학회논문지:컴퓨터 및 통신 시스템
    • /
    • 제9권12호
    • /
    • pp.321-332
    • /
    • 2020
  • 탈중앙화 접근법은 기존 시스템의 데이터 프라이버시 결함을 보완하기 위해 산·학계에서 폭넓게 연구되고 있다. 블록체인은 기록된 데이터는 위조할 수 없으며 합의를 기반으로 의사결정을 이루고 전반적인 거래의 비용은 저렴한 특징을 가지고 있다. 연합학습은 데이터 집합을 공개적으로 노출하지 않고 다수의 장치를 집합적으로 사용 함으로서 딥러닝 모델을 개선할 수 있게 한다. 모델 구축을 위해서는 자원을 사용하도록 참여자들의 동기 부여를 위한 적절하고 참여 비율에 합당한 인센티브 제도가 필수적이다. 그러나 중앙집중화된 인센티브 메커니즘은 중간 계층에 의존하고 여전히 병목현상을 유발하기 때문에 연합학습에 적용하기에는 어려움이 있다. 따라서, 우리는 이더리움 스마트컨트랙트를 활용하여 연합학습 어플리케이션을 위한 인센티브 모델을 제안한다. 구현 결과는 설계 목표를 충족하였고, 마지막 절에서 연합학습에서 프라이버시 및 데이터 유출과 관련된 민감 데이터에 대한 본 구현을 실행할 때 발생할 수 있는 사항들을 설명한다.

복합시스템을 위한 간접분산학습제어 (Indirect Decentralized Learning Control for the Multiple Systems)

  • Lee, Soo-Cheol
    • 한국산업정보학회:학술대회논문집
    • /
    • 한국산업정보학회 1996년도 추계 학술 발표회 발표논문집
    • /
    • pp.217-227
    • /
    • 1996
  • The new filed of learning control develops controllers that learn to improve their performance at executing a given task , based on experience performing this specific task. In a previous work[6], authors presented a theory of indirect learning control based on use of indirect adaptive control concepts employing simultaneous identification and control. This paper develops improved indirect learning control algorithms, and studies the use of such controller indecentralized systems. The original motivation of the learning control field was learning in robots doing repetitive tasks such as on an asssembly line. This paper starts with decentralized discrete time systems. and progresses to the robot application, modeling the robot as a time varying linear system in the neighborhood of the nominal trajectory, and using the usual robot controllers that are decentralized, treating each link as if it is independent of any coupling with other links. The resultof the paper is to show that stability of the indirect learning controllers for all subsystems when the coupling between subsystems is turned off, assures convergence to zero tracking error of the decentralized indirect learning control of the coupled system, provided that the sample tie in the digital learning controller is sufficiently short.

멀티에이전트 강화학습 기술 동향: 분산형 훈련-분산형 실행 프레임워크를 중심으로 (Survey on Recent Advances in Multiagent Reinforcement Learning Focusing on Decentralized Training with Decentralized Execution Framework)

  • 신영환;서승우;유병현;김현우;송화전;이성원
    • 전자통신동향분석
    • /
    • 제38권4호
    • /
    • pp.95-103
    • /
    • 2023
  • The importance of the decentralized training with decentralized execution (DTDE) framework is well-known in the study of multiagent reinforcement learning. In many real-world environments, agents cannot share information. Hence, they must be trained in a decentralized manner. However, the DTDE framework has been less studied than the centralized training with decentralized execution framework. One of the main reasons is that many problems arise when training agents in a decentralized manner. For example, DTDE algorithms are often computationally demanding or can encounter problems with non-stationarity. Another reason is the lack of simulation environments that can properly handle the DTDE framework. We discuss current research trends in the DTDE framework.

수직다물체시스템의 오차파형전달방식 간접적응형 분산학습제어 (Indirect Adaptive Decentralized Learning Control based Error Wave Propagation of the Vertical Multiple Dynamic Systems)

  • 이수철
    • 한국산업정보학회:학술대회논문집
    • /
    • 한국산업정보학회 2006년도 춘계 국제학술대회 논문집
    • /
    • pp.211-217
    • /
    • 2006
  • 반복학습제어는 특정목적 궤도의 반복작업을 수행하는 정밀도를 개선하는 제어기를 개발하는 기술이다. 기존 연구에서는 수직다물체의 반복정밀도를 개선하기 위하여 누적학습제어와 적응제어 기법을 한 반복영역에서 동시에 실시하는 기법을 개발하였다. 당초 이 기술은 생산조립라인의 산업용 로봇에서 발생하는 반복정밀도를 개선하기 위해 개발하였으며, 특히, 분산학습기법은 산업용 로봇에서 발생하는 실질적 제어 방식에 유효한 기법이다. 본 논문에서 개발한 제어기술은 한 반복영역의 모든 시간대의 입출력 정보를 동시에 학습하기 보다는 매 시간대의 입출력 정보를 각 시간대 마다 충분히 학습하고 다음 시간대의 정보를 학습하는 것이다. 본 논문에서 개발한 기술을 산업용 로봇과 의료기기에 적용하면 수직다물체의 정밀도 품질보증 확보에 큰 기여를 하게 된다.

  • PDF