통합 검색 | Korea Science

강화학습에 의해 학습된 기는 로봇의 성능 비교 (Performance Comparison of Crawling Robots Trained by Reinforcement Learning Methods)

박주영;정규백;문영준
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 2007년도 춘계학술대회 학술발표 논문집 제17권 제1호
- /
- pp.33-36
- /
- 2007
최근에 인공지능 분야에서는, 국내외적으로 강화학습(reinforcement learning)에 관한 관심이 크게 증폭되고 있다. 강화학습의 최근 경향을 살펴보면, 크게 가치함수를 직접 활용하는 방법(value function-based methods), 제어 전략에 대한 탐색을 활용하는 방법(policy search methods), 그리고 액터-크리틱 방법(actor-critic methods)의 세가지 방향으로 발전하고 있음을 알 수 있다. 본 논문에서는 이중 세 번째 부류인 액터-크리틱 방법 중 NAC(natural actor-critic) 기법의 한 종류인 RLS-NAC(recursive least-squares based natural actor-critic) 알고리즘을 다양한 트레이스 감쇠계수를 사용하여 연속제어입력(real-valued control inputs)으로 제어되는 Kimura의 기는 로봇에 대해 적용해보고, 그 성능을 기존의 SGA(stochastic gradient ascent) 알고리즘을 이용하여 학습한 경우와 비교해보도록 한다.
PDF

RPO 기반 강화학습 알고리즘을 이용한 로봇제어 (Robot Control via RPO-based Reinforcement Learning Algorithm)

김종호;강대성;박주영
- 한국지능시스템학회논문지
- /
- 제15권4호
- /
- pp.505-510
- /
- 2005
제어 입력 선택 문제에 있어서 확률적 전략을 활용하는 RPO(randomized policy optimizer) 기법은 최근에 개발된 강화학습 기법으로써, 많은 적용 사례를 통해서 그 가능성이 입증되고 있다 본 논문에서는, 수정된 RPO 알고리즘을 제안하는데, 이 수정된 알고리즘의 크리틱 네트워크 부분은 RLS(recursive least square) 기법을 통하여 갱신된다. 수정된 RPO 기법의 효율성을 확인하기 위해 Kimura에 의해서 연구된 로봇에 적용하여 매우 우수한 성능을 관찰하였다. 또한, 매트랩 애니메이션 프로그램의 개발을 통해서, 로봇의 이동이 시간에 따라 가속되는 학습 알고리즘의 효과를 시각적으로 확인 할 수 있었다.
https://doi.org/10.5391/JKIIS.2005.15.4.505 인용 PDF KSCI

SGA 기반 강화학습 알고리즘을 이용한 로봇 제어 (Robot Control via SGA-based Reinforcement Learning Algorithms)

박주영;김종호;신호근
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 2004년도 추계학술대회 학술발표 논문집 제14권 제2호
- /
- pp.63-66
- /
- 2004
The SGA(stochastic gradient ascent) algorithm is one of the most important tools in the area of reinforcement learning, and has been applied to a wide range of practical problems. In particular, this learning method was successfully applied by Kimura et a1. [1] to the control of a simple creeping robot which has finite number of control input choices. In this paper, we considered the application of the SGA algorithm to Kimura's robot control problem for the case that the control input is not confined to a finite set but can be chosen from a infinite subset of the real numbers. We also developed a MATLAB-based robot animation program, which showed the effectiveness of the training algorithms vividly.
PDF

강화학습 기법과 메타학습을 이용한 기는 로봇의 이동 (Locomotion of Crawling Robots Based on Reinforcement Learning and Meta-Learning)

문영준;정규백;박주영
- 한국지능시스템학회:학술대회논문집
- /
- 한국지능시스템학회 2007년도 추계학술대회 학술발표 논문집
- /
- pp.395-398
- /
- 2007
최근 인공지능 분야에서는 강화학습(Reinforcement Learning)에 대한 관심이 크게 증폭되고 있으며, 여러 관련 분야에 적용되고 있다. 본 논문에서는 강화학습 기법 중 액터-크리틱 계열에 속하는 RLS-NAC 알고리즘을 활용하여 Kimura의 기는 로봇의 이동을 다룰 때에 중요 파라미터의 결정을 위하여 meta-learning 기법을 활용하는 방안에 고려한다.
PDF

RPO 기반 강화학습 알고리즘을 이용한 로봇 제어 (Robot Control via RPO-based Reinforcement Learning Algorithm)

김종호;강대성;박주영
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 2005년도 춘계학술대회 학술발표 논문집 제15권 제1호
- /
- pp.217-220
- /
- 2005
The RPO algorithm is a recently developed tool in the area of reinforcement Loaming, And it has been shown In be very successful in several application problems. In this paper, we consider a robot-control problem utilizing a modified RPO algorithm in which its critic network is adapted via RLS(Recursive Least Square) algorithm. We also developed a MATLAB-based animation program, by which the effectiveness of the training algorithms were observed.
PDF

Fuzzy PI with Gain Scheduling Control for a Flexible Joint Robot

Hidenori, Kimura;Lee, Sang-Gu
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2001년도 ICCAS
- /
- pp.93.2-93
- /
- 2001
This paper presents the implementation of fuzzy PI gain scheduling controller (FPICGS) for controlling flexible joint robot arms with uncertainties from time-varying load. The term FPICGS is called based on a combination of fuzzy PI control scheme with a set of rule bases. Principle of design for a FPICGS is given along with the implementation of the designed computer aided control system. The experiment reveals an effectiveness of the proposed control scheme for flexible joint robot arms driven by a DC motorhooked with a spring which both parameters are completely unknown parameters ...
PDF

A variable-speed deburring robot using the repetitive control

Kimura, Yoichi;Mukai, Ryoji;Kobayashi, Fuminori
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1989년도 한국자동제어학술회의논문집; Seoul, Korea; 27-28 Oct. 1989
- /
- pp.663-668
- /
- 1989
Control methods to achieve efficient and accurate deburring robots are proposed. For efficiency, cutting speed is controlled adoptively with the cutting load. For accuracy, it adopts repetitive control. Since usual repetitive control cannot afford dynamical speed changes, the proposed method controls in an interpolating manner using several waveforms stored in the controller. Successful experimental results axe shown.
PDF

Conditions for manipulation of object with multiple contacts by intelligent Jig system

Yashima, Masahito;Kimura, Hiroshi
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1995년도 Proceedings of the Korea Automation Control Conference, 10th (KACC); Seoul, Korea; 23-25 Oct. 1995
- /
- pp.522-525
- /
- 1995
A manipulation of a multiple contacted object by a Rotational Base and Single-jointed Finger mechanism(RBSF mechanism) is discussed. The manipulation is characterized by multiple contacts on an object and large motions of the object with sliding contacts. The kinematics and dynamics allowing sliding at multiple contacts are explored. The conditions for manipulation of an object at multiple contacts by the RBSF mechanism, which cannot exert arbitrary contact forces because it has a fewer number of joints than is required for active control, is presented.
PDF

Evolution Strategies Based Particle Filters for Nonlinear State Estimation

Uosaki, Katsuji;Kimura, Yuuya;Hatanaka, Toshiharu
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2003년도 ICCAS
- /
- pp.559-564
- /
- 2003
Recently, particle filters have attracted attentions for nonlinear state estimation. They evaluate a posterior probability distribution of the state variable based on observations in simulation using so-called importance sampling. However, degeneracy phenomena in the importance weights deteriorate the filter performance. A new filter, Evolution Strategies Based Particle Filter, is proposed to circumvent this difficulty and to improve the performance. Numerical simulation results illustrate the applicability of the proposed idea.
PDF

A robust control system design by a parameter space approach based on sign difinite condition

Kimura, Tetsuya;Hara, Shinji
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1991년도 한국자동제어학술회의논문집(국제학술편); KOEX, Seoul; 22-24 Oct. 1991
- /
- pp.1533-1538
- /
- 1991
A parameter space approach for robust control system design is developed by reducing several design specifications to sign definite conditions. It is shown that the gain and phase margin constraints for the parametric perturbed plant hold if and only if the four Kharitonov systems satisfy the margins. On pole location, it is shown that D-stability of convex combinations (1-t)p(s)+tq(s) can be determined by the coefficients corresponding to p(s) and q(s) based on the sign definite condition. We show a method of PI-type robust control system design as a useful example.
PDF

검색결과 14건 처리시간 0.049초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)