1 |
V. Francois-Lavet, P. Henderson, R. Islam, M. G. Bellemare, and J. Pineau, "An introduction to deep reinforcement learning," Foundations and Trends in Machine Learning, vol. 11, no. 3-4, pp. 219-354, 2018.
DOI
|
2 |
O. Alemi, J. Françoise, and P. Pasquier, "GrooveNet: real-time music-driven dance movement generation using artificial neural networks," in Workshop on Machine Learning for Creativity in conjunction with the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, Canada, 2017.
|
3 |
A. Raghu, M. Komorowski, L. A. Celi, P. Szolovits, and M. Ghassemi, "Continuous state-space models for optimal sepsis treatment-a deep reinforcement learning approach," in Proceedings of the Machine Learning for Health Care Conference (MLHC), Boston, MA, 2017, pp. 147-163.
|
4 |
R. Garg and D. P. Nayak, "Game of tic-tac-toe: Simulation using Min-Max algorithm," International Journal of Advanced Research in Computer Science, vol. 8, no. 7, pp. 1074-1077, 2017.
DOI
|
5 |
C. Jin, Z. Allen-Zhu, S. Bubeck, and M. I. Jordan, "Is Q-learning provably efficient?," Advances in Neural Information Processing Systems, vol. 31, pp. 4863-4873, 2018.
|