Acknowledgement
Grant : 초연결 지능 인프라 원천기술 연구개발
Supported by : 정보통신기술진흥센터
References
- 장수영 외, "심층 강화학습 기술 동향," 전자통신동향분석 34권 제4호, 2019. 8, pp. 1-14. https://doi.org/10.22648/etri.2019.j.340401
- R.S. Sutton et al., Reinforcement Learning: An Introduction, 2nd edition, Cambridge, MA, USA: MIT Press, 2018.
- Y. LeCun et al., "Deep Learning," Nature, vol. 521, May 2015. pp. 436-444. https://doi.org/10.1038/nature14539
- V. Mnih et al., "Playing Atari with Deep Reinforcement Learning," arXiv:1312.5602, Dec. 2013.
- A.S. Polydoros et al., "Survey of Model-based Reinforcement Learning: Applications on Robotics," J. Intell. Robotic Syst., vol. 86, no. 2, Mar. 2017, pp. 153-173. https://doi.org/10.1007/s10846-017-0468-y
- J. Hwangbo et al., "Control of a Quadrotor with Reinforcement Learning," arXiv:1707.5110, July 2017.
- J. Zhang et al., "Query-Efficient Imitation Learning for End-to-End Autonomous Driving," arXiv:1605.06450, May 2016.
- H. Mao et al., "Resource Management with Deep Reinforcement Learning," in Proc. HotNets'16 , Atlanta, CA, USA, Nov. 2016, pp. 50-56.
- H. Mao et al., "Neural Adaptive Video Streaming with Pensieve," in Proc. Conf. SIGCOMM'17 , Los Angeles, CA, USA, Aug. 2017, pp. 197-210.
- H. Mao et al., "Learning Scheduling Algorithms for Data Processing Clusters," arXiv:1810.01963, Oct. 2018.
- 김근영 외, "기계학습을 활용한 5G 통신 동향," 전자통신동향분석 31권 제5호, 2016.10, pp. 1-10. https://doi.org/10.22648/ETRI.2016.J.310501
- Y. Deng et al., "Deep Direct Reinforcement Learning for Financial Signal Representation and Trading," IEEE Trans. Neural Netw. Learning Syst., vol. 28, no. 3, March 2017, pp. 653-664. https://doi.org/10.1109/TNNLS.2016.2522401
- https://www.yna.co.kr/view/AKR20171018151400017?input=1179m
- V. Mnih et al., "Asynchronous Methods for Deep Reinforcement Learning," in Proc. Int Conf. Machine Learning, New York, USA, June 2016, pp. 1928-1937.
- T.P. Lillicrap et al., "Continuous Control with Deep Reinforcement Learning," arXiv:1509:02971, Sept. 2015.
- J. Schulman et al., "Trust Region Policy Optimization," in Proc. Int. Conf. Machin Learning, Lille, France, July 2015, pp. 1889-1897.
- J. Schulman et al., "Proximal Policy Optimization Algorithms," arXiv:1707.06347, Jul. 2017.
- T. Schaul et al., "Prioritized Experience Replay," arXiv: 1511.05952, Nov. 2015.
- Z. Wang et al., "Dueling Network Architectures for Deep Reinforcement Learning," in Proc. Int Conf. Machine Learning, New York, USA, June 2016, pp. 1995-2003.
- H. Hasselt et al., "Deep Reinforcement Learning with Double Q-Learning," in Proc. AAAI Conf. Artif. Intell., Fhoenix, AZ, USA, Feb. 2016, pp. 2094-2100.
- 오일석, 패턴인식, 교보문고, 2008년.
- https://hunkim.github.io/ml/
- I. Goodfellow et al., Deep Learning , MIT Press, 2016.
- L. Espeholt et al., "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 1407-1416.
- D. Horgan et al., "Distributed Prioritized Experienced Replay," arXiv:1803.00933, March 2018.
- S. Kapturowski et al., "Recurrent Experience Replay in Distributed Reinforcement Learning," in Proc. Int. Conf. Machine Learning , Long Beach, CA, USA, May 2019.
- R. Lowe et al., "Multi-Agent Actor Critic for Mixed Cooperative-Competitive Environments," arXiv:1706.02275, July 2017.
- T. Rashid et al., "QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 4295-4304.
- S. Li et al., "Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient," in Proc. AAAI Conf. Artif. Intell., Honolulu, HI, USA, Jan. 2019.
- https://nervanasystems.github.io/coach/
- https://github.com/NervanaSystems/coach
- https://www.tensorflow.org/?hl=ko
- https://mxnet.incubator.apache.org/
- https://software.intel.com/en-us/frameworks/tensorflow
- https://gym.openai.com/
- https://github.com/openai/roboschool
- https://github.com/Breakend/gym-extensions
- https://github.com/bulletphysics/bullet3
- http://vizdoom.cs.put.edu.pl/
- http://carla.org/
- https://github.com/deepmind/pysc2
- https://github.com/deepmind/dm_control
- https://opensource.google/projects/dopamine
- P.S. Castro et al., "Dopamine: A Research Framework for Deep Reinforcement Learning," arXiv:1812.06110, Dec. 2018.
- https://github.com/google/dopamine
- https://keras.io/
- https://github.com/keras-rl/keras-rl
- https://github.com/openai/baselines
- tps://www.open-mpi.org
- https://spinningup.openai.com/en/latest/
- https://github.com/openai/spinningup
- https://gym.openai.com/envs/#mujoco
- https://ray.readthedocs.io/en/latest/rllib.html
- E. Liang et al., "RLlib: Abstractions for Distributed Reinforcement Learning," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 3053-3062.
- https://ray.readthedocs.io/en/latest/index.html#
- https://github.com/ray-project/ray
- https://pytorch.org/
- https://stable-baselines.readthedocs.io/en/master/
- https://github.com/hill-a/stable-baselines
- https://github.com/araffin/rl-baselines-zoo
- https://tensorforce.readthedocs.io/en/latest/
- https://github.com/tensorforce/tensorforce
- https://github.com/mgbellemare/Arcade-Learning-Environment
- https://github.com/microsoft/MazeExplorer
- https://github.com/openai/retro
- https://opensim.stanford.edu
- https://github.com/ntasfi/PyGame-Learning-Environment
- https://github.com/tensorflow/agents
- https://github.com/deepmind/trfl
- https://winderresearch.com/a-comparison-of-reinforcementlearning-frameworks-dopamine-rllib-keras-rl-coach-trfltensorforce-coach-and-more/
- https://medium.com/@vermashresth/a-primer-on-deepreinforcement-learning-frameworks-part-1-6c9ab6a0f555
- https://mc.ai/choosing-a-deep-reinforcement-learning-library/