Automatic Adaptive Space Segmentation for Reinforcement Learning

Komori, Yuki;Notsu, Akira;Honda, Katsuhiro;Ichihashi, Hidetomo;

doi:10.5391/IJFIS.2012.12.1.36

International Journal of Fuzzy Logic and Intelligent Systems

제12권1호
/
Pages.36-41
/
2012
/
1598-2645(pISSN)
/
2093-744X(eISSN)

한국지능시스템학회 (Korean Institute of Intelligent Systems)

DOI QR Code

Automatic Adaptive Space Segmentation for Reinforcement Learning

Komori, Yuki (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University) ;
Notsu, Akira (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University) ;
Honda, Katsuhiro (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University) ;
Ichihashi, Hidetomo (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University)

투고 : 2012.02.28
심사 : 2012.03.12
발행 : 2012.03.25

https://doi.org/10.5391/IJFIS.2012.12.1.36 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

We tested a single pendulum simulation and observed the influence of several situation space segmentation types in reinforcement learning processes in order to propose a new adaptive automation for situation space segmentation. Its segmentation is performed by the Contraction Algorithm and the Cell Division Approach. Also, its automation is performed by "entropy," which is defined on action values’ distributions. Simulation results were shown to demonstrate the influence and adaptability of the proposed method.

키워드

참고문헌

R. S. Sutton, "Learning to Predict by Method of Temporal Differences," Machine Learning, vol. 3, 1, pp. 9-44, 1988.
T. Jaakkola, M. Jordan, and S. P. Singh, " On the Convergence of Stochastic Iterative Dynamic Programming Algorithms," Neural Computation, vol. 6, pp. 341-362, 1992.
C. J. C. H. Watkins and P. Dayan, "Technical Note: QLearning," Machine Learning, vol. 8, pp. 56-68, 1992.
Y. Kashimura, A. Ueno, and S. Tatsumi, "A Continuous Action Space Representation by Particle Filter for Reinforcement Learning," JSAI2008, pp. 118-121, 2008.
A. Notsu, H. Honda, H. Ichihashi, and H. Wada, "Contraction Algorithm in State and Action space for Qlearning," Proc. of SCIS&ISIS, pp. 93-96, 2009.
A. Notsu, H. Wada, H. Honda, and H. Ichihashi, "Cell Division Approach for Search Space in Reinforcement Learning," International Journal of Computer Science and Network Security, vol. 8, no. 6, 2008.
A. Ito and M. Kanabuchi, "Speeding up Multi-Agent Reinforcement Learning by Coarse-Graining of Perception Hunter Game as an Example," IEICE Trans. D, vol. J84-D1, no. 3, pp. 285-293, 2001.
M. Nagayoshi, H. Muraoand, and H. Tamaki, "Switching Reinforcement Learning to Mimic an Infant's Motor Development Application to Two-dimensional Continuous Action Space," Proc. SICE Annual Conference 2010 (SICE 2010), pp.243-246 (TA09-3(on DVD-ROM)), 2010.

International Journal of Fuzzy Logic and Intelligent Systems

Automatic Adaptive Space Segmentation for Reinforcement Learning

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)