Browse > Article

Multi-Dimensional Reinforcement Learning Using a Vector Q-Net - Application to Mobile Robots  

Kiguchi, Kazuo (Department of Advanced Systems Control Engineering, Saga University)
Nanayakkara, Thrishantha (Department Advanced Systems Control Engineering, Saga University)
Watanabe, Keigo (Department Micro System Engineering, Nagoya University)
Fukuda, Toshio (Department Micro System Engineering, Nagoya University)
Publication Information
International Journal of Control, Automation, and Systems / v.1, no.1, 2003 , pp. 142-148 More about this Journal
Abstract
Reinforcement learning is considered as an important tool for robotic learning in unknown/uncertain environments. In this paper, we propose an evaluation function expressed in a vector form to realize multi-dimensional reinforcement learning. The novel feature of the proposed method is that learning one behavior induces parallel learning of other behaviors though the objectives of each behavior are different. In brief, all behaviors watch other behaviors from a critical point of view. Therefore, in the proposed method, there is cross-criticism and parallel learning that make the multi-dimensional learning process more efficient. By ap-plying the proposed learning method, we carried out multi-dimensional evaluation (reward) and multi-dimensional learning simultaneously in one trial. A special neural network (Q-net), in which the weights and the output are represented by vectors, is proposed to realize a critic net-work for Q-learning. The proposed learning method is applied for behavior planning of mobile robots.
Keywords
Reinforcement learning; Q-learning; multi-dimensional evaluation; neural networks; intelligent robot;
Citations & Related Records

Times Cited By SCOPUS : 2
연도 인용수 순위
1 Multiple reward criterion for cooperative behavior acquisition in a multiagent environment /
[ E. Uchibe;M. Asada ] / Proc. of IEEE International Conf. on Systems, Man, and Cybernetics
2 Interaction and intelligeng behavior /
[ M. J. Mataric ] / Ph. D. Dissertation, MIT
3 A neural substrate of prediction and reward /
[ W. Schultz;P. Dayan;P.R. Montague ] / Science   DOI
4 Purposive behavior acquisition for a real robot by vision-based reinforcement learning /
[ M. Asada;S. Noda;S. Tawaratumida;K. Hosoda ] / Machine Learning
5 Estimation of internal parameters of rigid body links of manipulators /
[ C. H. An;C. G. Atkeson;J. M. Hollerbach ] / Aartificial Intelligence Memo 887
6 /
[ R. S. Sutton;A. G. Barto ] / Reinforcement Learning
7 Learning from delayed rewards /
[ C. J. C. H Watkins ] / Ph. D. Dissertation, Cambridge University
8 Reinforcement learning for robots using neural networks /
[ L. J. Lin ] / Ph. D. Dissertation, Cambridge Mellon University
9 Learning architecture for real robotic systems Extension of connectionist q-learning for continuous robot control domain /
[ F. Saito;T. Fukuda ] / Proc. of IEEE International Conference on Robotics and Automation
10 Automatic programming of behavior-based robots using reinforcement learning /
[ S. Mahadevan;J. Connell ] / Proc. of 9th National Conf. on Artificial Intelligence
11 Module-based reinforcement learning: experiments with a real robot /
[ Z. Kalmar;C. Szepesvari;A. Lorincz ] / Machine Learning   DOI   ScienceOn
12 Acquiring robot skills via reinforcement learning /
[ V. Gullapalli;J. A. Franklin;H. Benbrahim ] / IEEE Control Systems Magazine   DOI
13 A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning /
[ H. K. Beom;H. S. Cho ] / IEEE Trans. on Systems, Man, and Cybernetics   DOI   ScienceOn