Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2003.10B.3.249

Multagent Control Strategy Using Reinforcement Learning  

Lee, Hyong-Ill (김포대학 소프트웨어제작과)
Kim, Byung-Cheon (한경대학교 웹정보공학과)
Abstract
The most important problems in the multi-agent system are to accomplish a goal through the efficient coordination of several agents and to prevent collision with other agents. In this paper, we propose a new control strategy for succeeding the goal of the prey pursuit problem efficiently. Our control method uses reinforcement learning to control the multi-agent system and consider the distance as well as the space relationship between the agents in the state space of the prey pursuit problem.
Keywords
Reinforcement Learning; Multiagent; Pursuit Problem; Control Strategy;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. L. Minsky, Theory of Neural-Analoy Reinforcement Systems and Application to th Brain-Model Problem, Ph.D.Thesis, Princeton University, Princeton, 1954
2 M. L. Minsky, 'Step towards aritificial intelligence,' In Proceedings of the Institute of Radio Engineers, 49, pp.8-30, 1961
3 A. W. Moore and C. G. Atkeson, 'Prioritized sweeping: Reinforcement Learning with less data and less real time,' Machine Leraning, 13, pp.103-130, 1993
4 F. S. Ho, 'Traffic flow modeling and control using artificial neural networks,' IEEE Control Systems, 16(5), pp.16-26, 1996   DOI   ScienceOn
5 A. G. Barto, D. A. White and D. A. Sofge, 'Reinforcement Learning and adaptive critic methods,' Handbook of Intelligent Control, pp.469-491, 1992
6 C. W. Anderson, 'Learning to control an inverted pendulum using neural networks,' IEEE Control Systems Magazine, 9, pp.31-37   DOI   ScienceOn
7 R. S. Sutton, A. G. Barto, 'Reinforcement Learning : An Introduction,' MIT Press, 1988
8 R. H. Crites and A. G. Barto, 'Improving Elevator Performance Using Reinforcement Learning,' Advances in Neural Information Processing Systems, 8, MIT Press, Cambridge, MA, 1996
9 C. J. C. H. Watkins, 'Technical note : Q-leraning,' Machine Leraning, 8, pp.279-292
10 S. P. Singh, 'Transfer of Leraning by Composing Solutions of Elemental Sequential Tasks,' Machine Leraning, 8, pp.323-339, 1992   DOI
11 M. Benda, V. Jagannathan and R. Dodhiawala, 'On optimalcooperation of knowledge source-an empirical invarstigation,' Technical Report BCS-G2010-28, Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, July, 1986
12 Peter Stone and Manuela Veloso, 'Multiagent System : A Survey from a Machine Learning,' Technical Report CMU-CS-97-193, The University of Carnegie Mellon, December, 1997
13 Sandip Sen, Mahendra Sekaran and John Hale, 'Learning to coordinate without sharing information,' National Conference on Aritificial Intelligence, pp.426-431, July, 1994
14 Tomas Haynes and Sandip Sen, 'Evloving behavioral strategies in predators and prey,' Adaptation and Learning in Multiagent System, Springer Verlag, Berlin, pp.113-126, 1996
15 L. M. Stephens and M. B. Merx, 'The effect of agent control strategy on the performance of a DAI pursuit problem,' In Proceeding of the 1990 Distributed AI Workshop, October, 1990