1 |
M. L. Minsky, Theory of Neural-Analoy Reinforcement Systems and Application to th Brain-Model Problem, Ph.D.Thesis, Princeton University, Princeton, 1954
|
2 |
M. L. Minsky, 'Step towards aritificial intelligence,' In Proceedings of the Institute of Radio Engineers, 49, pp.8-30, 1961
|
3 |
A. W. Moore and C. G. Atkeson, 'Prioritized sweeping: Reinforcement Learning with less data and less real time,' Machine Leraning, 13, pp.103-130, 1993
|
4 |
F. S. Ho, 'Traffic flow modeling and control using artificial neural networks,' IEEE Control Systems, 16(5), pp.16-26, 1996
DOI
ScienceOn
|
5 |
A. G. Barto, D. A. White and D. A. Sofge, 'Reinforcement Learning and adaptive critic methods,' Handbook of Intelligent Control, pp.469-491, 1992
|
6 |
C. W. Anderson, 'Learning to control an inverted pendulum using neural networks,' IEEE Control Systems Magazine, 9, pp.31-37
DOI
ScienceOn
|
7 |
R. S. Sutton, A. G. Barto, 'Reinforcement Learning : An Introduction,' MIT Press, 1988
|
8 |
R. H. Crites and A. G. Barto, 'Improving Elevator Performance Using Reinforcement Learning,' Advances in Neural Information Processing Systems, 8, MIT Press, Cambridge, MA, 1996
|
9 |
C. J. C. H. Watkins, 'Technical note : Q-leraning,' Machine Leraning, 8, pp.279-292
|
10 |
S. P. Singh, 'Transfer of Leraning by Composing Solutions of Elemental Sequential Tasks,' Machine Leraning, 8, pp.323-339, 1992
DOI
|
11 |
M. Benda, V. Jagannathan and R. Dodhiawala, 'On optimalcooperation of knowledge source-an empirical invarstigation,' Technical Report BCS-G2010-28, Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, July, 1986
|
12 |
Peter Stone and Manuela Veloso, 'Multiagent System : A Survey from a Machine Learning,' Technical Report CMU-CS-97-193, The University of Carnegie Mellon, December, 1997
|
13 |
Sandip Sen, Mahendra Sekaran and John Hale, 'Learning to coordinate without sharing information,' National Conference on Aritificial Intelligence, pp.426-431, July, 1994
|
14 |
Tomas Haynes and Sandip Sen, 'Evloving behavioral strategies in predators and prey,' Adaptation and Learning in Multiagent System, Springer Verlag, Berlin, pp.113-126, 1996
|
15 |
L. M. Stephens and M. B. Merx, 'The effect of agent control strategy on the performance of a DAI pursuit problem,' In Proceeding of the 1990 Distributed AI Workshop, October, 1990
|