1 |
Micael L. Littman, Anthony R. Cassandra, Lesile Pack Kaelbling, 'Efficient Dynamic-Programming Updates in Partially Observable Markov Decision Processes,' Brown University Technical Report CS-95-19, 1995
|
2 |
M. Riedmiller, A. Merke, D. Meier, 'Karlsruhe Brainstormers A Reinforcement Learning Approach to Robotic Soccer,' Robocup-2000 : Robot World Cup IV, Springer Verlag, Berlin, 2001
|
3 |
Michael L. Littman, Anthony R. Cassandra, Lesile Pack Kaebling, 'Learning Policies for Partially Observable Environments : Scaling up,' Proceedings of the Eleventh International Conference on Machine Learning, pp.157-163, San Francisco, CA, 1994
|
4 |
Noda Itsuki, Matsubara Hitoshi, Hiraki Kazuo, 'Learning Cooperative Behavior in Multi-agent EnvironmentA Case Study of Choice of Play-Plans in Soccer,' Applied Artificial Intelligence, Vol.12, pp.233-250, 1998
DOI
|
5 |
Norihiko Ono, Kenji Fukumoto, 'Multi-agent Reinforcement Learning : A Modular Approach,' Proceedings of the Second International Conference on Multi-Agent Systems, AAAI Press. pp.252-258, 1996
|
6 |
Veloso M., Pagello E., and Kitano H., 'RoboCup-99 : Robot Soccer World Cup III,' Springer Verlag, Berlin, 2000
|
7 |
Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Pat Riley etc., 'RoboCup Soccer Server Manual 7.06,' 2001
|
8 |
Peter Stone, Richard S. Sutton, Satinder Singh, 'Reinforcement Learning for 3 vs. 2 Keepway,' RoboCup-2000 : Robot World Cup IV, Springer Verlag, Berlin, 2001
|
9 |
Junling Hu, Michael P. Wellman, 'Multi-agent Reinforcement Learning : Theoretical Framework and an Algorithm,' Proceedings of the Fifteenth International Conference on Machine Learning, pp.242-250, 1998
|
10 |
Takayuki Kohri. et al., 'An Adaptive Architecture for Modular Q-Learning,' Journal of AI Research, 1998
|
11 |
Weiss G, 'Distributed Artificial Intelligence Meets Machine Learning,' Springer Verlag, Berlin, 1997
|
12 |
Kui-Hong Park, Yong-Jae Kim, Jong-Hwan Kim, 'Modular Q-Learning based Multi-Agent Cooperation for Robot Soccer,' Robotics and Autonomous Systems Vol.35, pp.109-122, 2001
DOI
ScienceOn
|
13 |
Peter Stone, 'Layered Learning in Multiagent Systems,' MIT Press, 2000
|
14 |
Justin A. Boyan, Andrew W. Moore, 'Generalization in Reinforcement Learning Safely Approximating the Value Function, Advances in Neural Information Processing Systems,' Vol.6, pp.671-678, 1993
|
15 |
Kostas Kostiadis, Huosheng Hu, 'Reinforcement Learning and Co-operation in a Simulatied Multi-agent System,' RoboCup-98 : Robot Soccer World Cup II, pp.366-377, Springer Verlag, Berlin, 1999
|
16 |
Tomohito Andou, 'A Robocup Team which Reinforces Positioning with Observation,' Robocup-97 : Robot World Cup I, pp.373-383, Springer Verlag, Berlin, 1998
|
17 |
Lesile Pack Kaebling, Michael L. Littman, Andrew W. Moore, 'Reinforcement Learning : A Survey,' Journal of AI Research Vol.4, pp.147-166, 1996
|
18 |
Marco Wiering, Rafal Salustowicz, Jurgen Schminhuber, 'Reinforcement Learning Soccer Teams with Incomplete World Models,' Journal of Autonomous Robots, Vol.1, No.12, 1999
DOI
|
19 |
Weiss G, 'Distributed Reinforcement Learning,' Journal of Robotics and Autonomous Systems, Vol.15, pp.135-142, 1995
DOI
ScienceOn
|