1 |
D. Silver, A. Huang, C. J. Maddison, A.Guez, L.t Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, et al. Mastering the game of go with deep neural networks and tree search. Nature, Vol 529, No. 7587, pp. 484-489, 2016. https://doi.org/10.1038/nature16961
DOI
|
2 |
R. S. Sutton, A. G. Barto. Reinforcement learning: An introduction, volume 1. MIT press Cambridge, 1998. https://doi.org/10.1016/S1364-6613(99)01331-5
|
3 |
Mnih, Volodymyr, et al. "Playing atari with deep reinforcement learning." NIPS 2013. http://www.cs.toronto.edu/-vmnih/docs/dqn.pdf
|
4 |
A. Amiranashvili, A. Dosovitskiy, V. Koltun and T. Brox, TD OR NOT TD: Analyzing The Role Of Temporal Differencing In Deep Reinforcement Learning, ICLR 2018. http://arxiv.org/abs/1806.01175
|
5 |
S. Gu, T. Lillicrap, Z. Ghahramani, R. E. Turner, S. Levine, Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic, ICLR 2017. http://arxiv.org/abs/1611.02247
|
6 |
T. Lillicrap, J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, Continuous control with deep reinforcement learning, ICLR 2016. https://arxiv.org/abs/1509.02971
|
7 |
V. Nair and G. E. Hinton, Rectified Linear Units Improve Restricted Boltzmann Machines, ICML 2010. https://www.cs.toronto.edu/-hinton/absps/reluICML.pdf
|
8 |
OpenAI Gym: https://gym.openai.com
|
9 |
Cart-Pole-V0: https://github.com/openai/gym/wiki/Cart-Pole-v0
|
10 |
Cart-Pole-DQN: https://github.com/rlcode/reinforcement-learning-kr/blob/master/2-cartpole/1-dqn/cartpole_dqn.py, 8 Jul. 2017.
|
11 |
Tensorflow: https://github.com/tensorflow/tensorflow, 31 Oct. 2019.
|
12 |
Keras : https://keras.io/api/ Oct. 2019.
|
13 |
G. Sun, G. O. Boateng, H. Huang and W. Jiang, "A Reinforcement Learning Framework for Autonomous Cell Activation and Customized Energy-Efficient Resource Allocation in C-RANs," KSII Transactions on Internet and Information Systems, vol. 13, no. 8, pp. 3821-3841, 2019. https://doi.org/10.3837/tiis.2019.08.001
DOI
|
14 |
R. Mu and X. Zeng, "A Review of Deep Learning Research," KSII Transactions on Internet and Information Systems, vol. 13, no. 4, pp. 1738-1764, 2019. https://doi.org/10.3837/tiis.2019.04.001
DOI
|