A Survey on Deep Reinforcement Learning Libraries

Shin, S.J.;Cho, C.L.;Jeon, H.S.;Yoon, S.H.;Kim, T.Y.;

doi:10.22648/ETRI.2019.J.340608

Electronics and Telecommunications Trends (전자통신동향분석)

Volume 34 Issue 6
/
Pages.87-99
/
2019
/
1225-6455(pISSN)

Electronics and Telecommunications Research Institute (한국전자통신연구원)

DOI QR Code

A Survey on Deep Reinforcement Learning Libraries

심층강화학습 라이브러리 기술동향

신승재 (지능네트워크연구실) ;
조충래 (지능네트워크연구실) ;
전홍석 (지능네트워크연구실) ;
윤승현 (지능네트워크연구실) ;
김태연 (지능네트워크연구실)

Published : 2019.12.01

https://doi.org/10.22648/ETRI.2019.J.340608 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Reinforcement learning is a type of machine learning paradigm that forces agents to repeat the observation-action-reward process to assess and predict the values of possible future action sequences. This allows the agents to incrementally reinforce the desired behavior for a given observation. Thanks to the recent advancements of deep learning, reinforcement learning has evolved into deep reinforcement learning that introduces promising results in various control and optimization domains, such as games, robotics, autonomous vehicles, computing, industrial control, and so on. In addition to this trend, a number of programming libraries have been developed for importing deep reinforcement learning into a variety of applications. In this article, we briefly review and summarize 10 representative deep reinforcement learning libraries and compare them from a development project perspective.

Keywords

Acknowledgement

Grant : 초연결 지능 인프라 원천기술 연구개발

Supported by : 정보통신기술진흥센터

References

장수영 외, "심층 강화학습 기술 동향," 전자통신동향분석 34권 제4호, 2019. 8, pp. 1-14. https://doi.org/10.22648/etri.2019.j.340401
R.S. Sutton et al., Reinforcement Learning: An Introduction, 2nd edition, Cambridge, MA, USA: MIT Press, 2018.
Y. LeCun et al., "Deep Learning," Nature, vol. 521, May 2015. pp. 436-444. https://doi.org/10.1038/nature14539
V. Mnih et al., "Playing Atari with Deep Reinforcement Learning," arXiv:1312.5602, Dec. 2013.
A.S. Polydoros et al., "Survey of Model-based Reinforcement Learning: Applications on Robotics," J. Intell. Robotic Syst., vol. 86, no. 2, Mar. 2017, pp. 153-173. https://doi.org/10.1007/s10846-017-0468-y
J. Hwangbo et al., "Control of a Quadrotor with Reinforcement Learning," arXiv:1707.5110, July 2017.
J. Zhang et al., "Query-Efficient Imitation Learning for End-to-End Autonomous Driving," arXiv:1605.06450, May 2016.
H. Mao et al., "Resource Management with Deep Reinforcement Learning," in Proc. HotNets'16 , Atlanta, CA, USA, Nov. 2016, pp. 50-56.
H. Mao et al., "Neural Adaptive Video Streaming with Pensieve," in Proc. Conf. SIGCOMM'17 , Los Angeles, CA, USA, Aug. 2017, pp. 197-210.
H. Mao et al., "Learning Scheduling Algorithms for Data Processing Clusters," arXiv:1810.01963, Oct. 2018.
김근영 외, "기계학습을 활용한 5G 통신 동향," 전자통신동향분석 31권 제5호, 2016.10, pp. 1-10. https://doi.org/10.22648/ETRI.2016.J.310501
Y. Deng et al., "Deep Direct Reinforcement Learning for Financial Signal Representation and Trading," IEEE Trans. Neural Netw. Learning Syst., vol. 28, no. 3, March 2017, pp. 653-664. https://doi.org/10.1109/TNNLS.2016.2522401
https://www.yna.co.kr/view/AKR20171018151400017?input=1179m
V. Mnih et al., "Asynchronous Methods for Deep Reinforcement Learning," in Proc. Int Conf. Machine Learning, New York, USA, June 2016, pp. 1928-1937.
T.P. Lillicrap et al., "Continuous Control with Deep Reinforcement Learning," arXiv:1509:02971, Sept. 2015.
J. Schulman et al., "Trust Region Policy Optimization," in Proc. Int. Conf. Machin Learning, Lille, France, July 2015, pp. 1889-1897.
J. Schulman et al., "Proximal Policy Optimization Algorithms," arXiv:1707.06347, Jul. 2017.
T. Schaul et al., "Prioritized Experience Replay," arXiv: 1511.05952, Nov. 2015.
Z. Wang et al., "Dueling Network Architectures for Deep Reinforcement Learning," in Proc. Int Conf. Machine Learning, New York, USA, June 2016, pp. 1995-2003.
H. Hasselt et al., "Deep Reinforcement Learning with Double Q-Learning," in Proc. AAAI Conf. Artif. Intell., Fhoenix, AZ, USA, Feb. 2016, pp. 2094-2100.
오일석, 패턴인식, 교보문고, 2008년.
https://hunkim.github.io/ml/
I. Goodfellow et al., Deep Learning , MIT Press, 2016.
L. Espeholt et al., "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 1407-1416.
D. Horgan et al., "Distributed Prioritized Experienced Replay," arXiv:1803.00933, March 2018.
S. Kapturowski et al., "Recurrent Experience Replay in Distributed Reinforcement Learning," in Proc. Int. Conf. Machine Learning , Long Beach, CA, USA, May 2019.
R. Lowe et al., "Multi-Agent Actor Critic for Mixed Cooperative-Competitive Environments," arXiv:1706.02275, July 2017.
T. Rashid et al., "QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 4295-4304.
S. Li et al., "Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient," in Proc. AAAI Conf. Artif. Intell., Honolulu, HI, USA, Jan. 2019.
https://nervanasystems.github.io/coach/
https://github.com/NervanaSystems/coach
https://www.tensorflow.org/?hl=ko
https://mxnet.incubator.apache.org/
https://software.intel.com/en-us/frameworks/tensorflow
https://gym.openai.com/
https://github.com/openai/roboschool
https://github.com/Breakend/gym-extensions
https://github.com/bulletphysics/bullet3
http://vizdoom.cs.put.edu.pl/
http://carla.org/
https://github.com/deepmind/pysc2
https://github.com/deepmind/dm_control
https://opensource.google/projects/dopamine
P.S. Castro et al., "Dopamine: A Research Framework for Deep Reinforcement Learning," arXiv:1812.06110, Dec. 2018.
https://github.com/google/dopamine
https://keras.io/
https://github.com/keras-rl/keras-rl
https://github.com/openai/baselines
tps://www.open-mpi.org
https://spinningup.openai.com/en/latest/
https://github.com/openai/spinningup
https://gym.openai.com/envs/#mujoco
https://ray.readthedocs.io/en/latest/rllib.html
E. Liang et al., "RLlib: Abstractions for Distributed Reinforcement Learning," in Proc. Int. Conf. Machine Learning, Stockholm, Sweden, July 2018, pp. 3053-3062.
https://ray.readthedocs.io/en/latest/index.html#
https://github.com/ray-project/ray
https://pytorch.org/
https://stable-baselines.readthedocs.io/en/master/
https://github.com/hill-a/stable-baselines
https://github.com/araffin/rl-baselines-zoo
https://tensorforce.readthedocs.io/en/latest/
https://github.com/tensorforce/tensorforce
https://github.com/mgbellemare/Arcade-Learning-Environment
https://github.com/microsoft/MazeExplorer
https://github.com/openai/retro
https://opensim.stanford.edu
https://github.com/ntasfi/PyGame-Learning-Environment
https://github.com/tensorflow/agents
https://github.com/deepmind/trfl
https://winderresearch.com/a-comparison-of-reinforcementlearning-frameworks-dopamine-rllib-keras-rl-coach-trfltensorforce-coach-and-more/
https://medium.com/@vermashresth/a-primer-on-deepreinforcement-learning-frameworks-part-1-6c9ab6a0f555
https://mc.ai/choosing-a-deep-reinforcement-learning-library/

Electronics and Telecommunications Trends (전자통신동향분석)

A Survey on Deep Reinforcement Learning Libraries

심층강화학습 라이브러리 기술동향

Abstract

Keywords

Acknowledgement

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)