Robot locomotion via IRPO based Actor-Critic Learning Method

Kim, Jong-Ho;Kang, Dae-Sung;Park, Joo-Young;

대한전기학회:학술대회논문집 (Proceedings of the KIEE Conference)

대한전기학회 (The Korean Institute of Electrical Engineers)

IRPO 기반 Actor-Critic 학습 기법을 이용한 로봇이동

Robot locomotion via IRPO based Actor-Critic Learning Method

김종호 (고려대학교 제어계측공학과) ;
강대성 (고려대학교 제어계측공학과) ;
박주영 (고려대학교 제어계측공학과)

Kim, Jong-Ho (Dept. of Control & Instrumentation Engineering, Korea University) ;
Kang, Dae-Sung (Dept. of Control & Instrumentation Engineering, Korea University) ;
Park, Joo-Young (Dept. of Control & Instrumentation Engineering, Korea University)

발행 : 2005.07.18

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

The IRPO(Intensive Randomized Policy Optimizer) algorithm is a recently developed tool in the area of reinforcement leaming. And it has been shown to be very successful in several application problems. To compare with a general RL method, IRPO has some difference in that policy utilizes the entire history of agent -environment interaction. The policy is derived from the history directly, not through any kind of a model of the environment. In this paper, we consider a robot-control problem utilizing a IRPO algorithm. We also developed a MATLAH-based animation program, by which the effectiveness of the training algorithms were observed.

대한전기학회:학술대회논문집 (Proceedings of the KIEE Conference)

IRPO 기반 Actor-Critic 학습 기법을 이용한 로봇이동

Robot locomotion via IRPO based Actor-Critic Learning Method

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)