References
- M. L. Minsky, Theory of Neural-Analoy Reinforcement Systems and Application to th Brain-Model Problem, Ph.D.Thesis, Princeton University, Princeton, 1954
- M. L. Minsky, 'Step towards aritificial intelligence,' In Proceedings of the Institute of Radio Engineers, 49, pp.8-30, 1961
- A. G. Barto, D. A. White and D. A. Sofge, 'Reinforcement Learning and adaptive critic methods,' Handbook of Intelligent Control, pp.469-491, 1992
- A. W. Moore and C. G. Atkeson, 'Prioritized sweeping: Reinforcement Learning with less data and less real time,' Machine Leraning, 13, pp.103-130, 1993
- C. W. Anderson, 'Learning to control an inverted pendulum using neural networks,' IEEE Control Systems Magazine, 9, pp.31-37 https://doi.org/10.1109/37.24809
- F. S. Ho, 'Traffic flow modeling and control using artificial neural networks,' IEEE Control Systems, 16(5), pp.16-26, 1996 https://doi.org/10.1109/37.537205
- R. H. Crites and A. G. Barto, 'Improving Elevator Performance Using Reinforcement Learning,' Advances in Neural Information Processing Systems, 8, MIT Press, Cambridge, MA, 1996
- S. P. Singh, 'Transfer of Leraning by Composing Solutions of Elemental Sequential Tasks,' Machine Leraning, 8, pp.323-339, 1992 https://doi.org/10.1007/BF00992700
- C. J. C. H. Watkins, 'Technical note : Q-leraning,' Machine Leraning, 8, pp.279-292
- R. S. Sutton, A. G. Barto, 'Reinforcement Learning : An Introduction,' MIT Press, 1988
- M. Benda, V. Jagannathan and R. Dodhiawala, 'On optimalcooperation of knowledge source-an empirical invarstigation,' Technical Report BCS-G2010-28, Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, July, 1986
- Peter Stone and Manuela Veloso, 'Multiagent System : A Survey from a Machine Learning,' Technical Report CMU-CS-97-193, The University of Carnegie Mellon, December, 1997
- Sandip Sen, Mahendra Sekaran and John Hale, 'Learning to coordinate without sharing information,' National Conference on Aritificial Intelligence, pp.426-431, July, 1994
- Tomas Haynes and Sandip Sen, 'Evloving behavioral strategies in predators and prey,' Adaptation and Learning in Multiagent System, Springer Verlag, Berlin, pp.113-126, 1996
- L. M. Stephens and M. B. Merx, 'The effect of agent control strategy on the performance of a DAI pursuit problem,' In Proceeding of the 1990 Distributed AI Workshop, October, 1990