Automatic Adaptive Space Segmentation for Reinforcement Learning

Komori, Yuki;Notsu, Akira;Honda, Katsuhiro;Ichihashi, Hidetomo;

doi:10.5391/IJFIS.2012.12.1.36

International Journal of Fuzzy Logic and Intelligent Systems

Volume 12 Issue 1
/
Pages.36-41
/
2012
/
1598-2645(pISSN)
/
2093-744X(eISSN)

Korean Institute of Intelligent Systems (한국지능시스템학회)

DOI QR Code

Automatic Adaptive Space Segmentation for Reinforcement Learning

Komori, Yuki (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University) ;
Notsu, Akira (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University) ;
Honda, Katsuhiro (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University) ;
Ichihashi, Hidetomo (Department of Computer Sciences and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University)

Received : 2012.02.28
Accepted : 2012.03.12
Published : 2012.03.25

https://doi.org/10.5391/IJFIS.2012.12.1.36 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

We tested a single pendulum simulation and observed the influence of several situation space segmentation types in reinforcement learning processes in order to propose a new adaptive automation for situation space segmentation. Its segmentation is performed by the Contraction Algorithm and the Cell Division Approach. Also, its automation is performed by "entropy," which is defined on action values’ distributions. Simulation results were shown to demonstrate the influence and adaptability of the proposed method.

Keywords

References

R. S. Sutton, "Learning to Predict by Method of Temporal Differences," Machine Learning, vol. 3, 1, pp. 9-44, 1988.
T. Jaakkola, M. Jordan, and S. P. Singh, " On the Convergence of Stochastic Iterative Dynamic Programming Algorithms," Neural Computation, vol. 6, pp. 341-362, 1992.
C. J. C. H. Watkins and P. Dayan, "Technical Note: QLearning," Machine Learning, vol. 8, pp. 56-68, 1992.
Y. Kashimura, A. Ueno, and S. Tatsumi, "A Continuous Action Space Representation by Particle Filter for Reinforcement Learning," JSAI2008, pp. 118-121, 2008.
A. Notsu, H. Honda, H. Ichihashi, and H. Wada, "Contraction Algorithm in State and Action space for Qlearning," Proc. of SCIS&ISIS, pp. 93-96, 2009.
A. Notsu, H. Wada, H. Honda, and H. Ichihashi, "Cell Division Approach for Search Space in Reinforcement Learning," International Journal of Computer Science and Network Security, vol. 8, no. 6, 2008.
A. Ito and M. Kanabuchi, "Speeding up Multi-Agent Reinforcement Learning by Coarse-Graining of Perception Hunter Game as an Example," IEICE Trans. D, vol. J84-D1, no. 3, pp. 285-293, 2001.
M. Nagayoshi, H. Muraoand, and H. Tamaki, "Switching Reinforcement Learning to Mimic an Infant's Motor Development Application to Two-dimensional Continuous Action Space," Proc. SICE Annual Conference 2010 (SICE 2010), pp.243-246 (TA09-3(on DVD-ROM)), 2010.

International Journal of Fuzzy Logic and Intelligent Systems

Automatic Adaptive Space Segmentation for Reinforcement Learning

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)