1 |
Bengio, Y., P. Lamblin, D. Popovici, and H. Larochelle, "Greedy Layer-wise Training of Deep Networks", Advances in Neural Information Processing Systems, Vol.19, 2007, 153.
|
2 |
Fukushima, K., "Neocognitron : A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position", Biological Cybernetics, Vol.34, No.4, 1980, 193-202.
|
3 |
Hsu, K., H.V. Gupta, and S. Sorooshian, "Artificial Neural Network Modeling of the Rainfall Runoff Process", Water Resources Research, Vol.31, No.10, 1995, 2517-2530.
DOI
|
4 |
Mnih, V., K. Kavukcuoglu, D. Silver, A.A. Rusu, J. Veness, M.G. Bellemare, A. Graves, M. Riedmiller, A.K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, "Human-level Control Through Deep Reinforcement Learning", Nature, Vol. 518, No.7540, 2015, 529-533.
DOI
|
5 |
Sutton, R.S. and A.G. Barto, Reinforcement Learning : An Introduction, MIT press, Cambridge, 1998.
|
6 |
Sutton, R.S., D. McAllester, S. Singh, and Y. Mansour, "Policy Gradient Methods for Reinforcement Learning with Function Approximation", NIPS, Vol.99, 1999, 1057-1063.
|
7 |
Watkins, C.J.C.H. and P. Dayan, "Q-learning", Machine Learning, Vol.8, No.3, 1992, 279-292, doi : 10.1007/BF00992698(Downloaded April 2, 2016).
DOI
|