Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2002.9B.2.139

Design and implementation of Robot Soccer Agent Based on Reinforcement Learning  

Kim, In-Cheol (Dept.of Information Engineering, Kyounggi University)
Abstract
The robot soccer simulation game is a dynamic multi-agent environment. In this paper we suggest a new reinforcement learning approach to each agent's dynamic positioning in such dynamic environment. Reinforcement learning is the machine learning in which an agent learns from indirect, delayed reward an optimal policy to choose sequences of actions that produce the greatest cumulative reward. Therefore the reinforcement learning is different from supervised learning in the sense that there is no presentation of input-output pairs as training examples. Furthermore, model-free reinforcement learning algorithms like Q-learning do not require defining or learning any models of the surrounding environment. Nevertheless these algorithms can learn the optimal policy if the agent can visit every state-action pair infinitely. However, the biggest problem of monolithic reinforcement learning is that its straightforward applications do not successfully scale up to more complex environments due to the intractable large space of states. In order to address this problem, we suggest Adaptive Mediation-based Modular Q-Learning (AMMQL) as an improvement of the existing Modular Q-Learning (MQL). While simple modular Q-learning combines the results from each learning module in a fixed way, AMMQL combines them in a more flexible way by assigning different weight to each module according to its contribution to rewards. Therefore in addition to resolving the problem of large state space effectively, AMMQL can show higher adaptability to environmental changes than pure MQL. In this paper we use the AMMQL algorithn as a learning method for dynamic positioning of the robot soccer agent, and implement a robot soccer agent system called Cogitoniks.
Keywords
multi-agent system; robot soccer simulation game; reinforcement learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Micael L. Littman, Anthony R. Cassandra, Lesile Pack Kaelbling, 'Efficient Dynamic-Programming Updates in Partially Observable Markov Decision Processes,' Brown University Technical Report CS-95-19, 1995
2 M. Riedmiller, A. Merke, D. Meier, 'Karlsruhe Brainstormers A Reinforcement Learning Approach to Robotic Soccer,' Robocup-2000 : Robot World Cup IV, Springer Verlag, Berlin, 2001
3 Michael L. Littman, Anthony R. Cassandra, Lesile Pack Kaebling, 'Learning Policies for Partially Observable Environments : Scaling up,' Proceedings of the Eleventh International Conference on Machine Learning, pp.157-163, San Francisco, CA, 1994
4 Noda Itsuki, Matsubara Hitoshi, Hiraki Kazuo, 'Learning Cooperative Behavior in Multi-agent EnvironmentA Case Study of Choice of Play-Plans in Soccer,' Applied Artificial Intelligence, Vol.12, pp.233-250, 1998   DOI
5 Norihiko Ono, Kenji Fukumoto, 'Multi-agent Reinforcement Learning : A Modular Approach,' Proceedings of the Second International Conference on Multi-Agent Systems, AAAI Press. pp.252-258, 1996
6 Veloso M., Pagello E., and Kitano H., 'RoboCup-99 : Robot Soccer World Cup III,' Springer Verlag, Berlin, 2000
7 Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Pat Riley etc., 'RoboCup Soccer Server Manual 7.06,' 2001
8 Peter Stone, Richard S. Sutton, Satinder Singh, 'Reinforcement Learning for 3 vs. 2 Keepway,' RoboCup-2000 : Robot World Cup IV, Springer Verlag, Berlin, 2001
9 Junling Hu, Michael P. Wellman, 'Multi-agent Reinforcement Learning : Theoretical Framework and an Algorithm,' Proceedings of the Fifteenth International Conference on Machine Learning, pp.242-250, 1998
10 Takayuki Kohri. et al., 'An Adaptive Architecture for Modular Q-Learning,' Journal of AI Research, 1998
11 Weiss G, 'Distributed Artificial Intelligence Meets Machine Learning,' Springer Verlag, Berlin, 1997
12 Kui-Hong Park, Yong-Jae Kim, Jong-Hwan Kim, 'Modular Q-Learning based Multi-Agent Cooperation for Robot Soccer,' Robotics and Autonomous Systems Vol.35, pp.109-122, 2001   DOI   ScienceOn
13 Peter Stone, 'Layered Learning in Multiagent Systems,' MIT Press, 2000
14 Justin A. Boyan, Andrew W. Moore, 'Generalization in Reinforcement Learning Safely Approximating the Value Function, Advances in Neural Information Processing Systems,' Vol.6, pp.671-678, 1993
15 Kostas Kostiadis, Huosheng Hu, 'Reinforcement Learning and Co-operation in a Simulatied Multi-agent System,' RoboCup-98 : Robot Soccer World Cup II, pp.366-377, Springer Verlag, Berlin, 1999
16 Tomohito Andou, 'A Robocup Team which Reinforces Positioning with Observation,' Robocup-97 : Robot World Cup I, pp.373-383, Springer Verlag, Berlin, 1998
17 Lesile Pack Kaebling, Michael L. Littman, Andrew W. Moore, 'Reinforcement Learning : A Survey,' Journal of AI Research Vol.4, pp.147-166, 1996
18 Marco Wiering, Rafal Salustowicz, Jurgen Schminhuber, 'Reinforcement Learning Soccer Teams with Incomplete World Models,' Journal of Autonomous Robots, Vol.1, No.12, 1999   DOI
19 Weiss G, 'Distributed Reinforcement Learning,' Journal of Robotics and Autonomous Systems, Vol.15, pp.135-142, 1995   DOI   ScienceOn