1 |
Sungpill Kim, Deep Learning First Step, pp.17-33, Hanbit Media, 2016.
|
2 |
Taewoo Lee, Jinhoo Ryu, Heemin Park "Hovering Control of 1-Axial Drone with Reinforcement Learning", Journal of Korea Multimedia Society, Vol.21, No.2, pp.250-260, 2018.
DOI
|
3 |
Daniel R.Jiang, Emmanuel Ekwedike, Han Liu, "Feedback-Based Tree Search for Reinforcement Learning", Journal of Korea Multimedia Society, arXiv:1805.05935, 2018.
|
4 |
Jeongsoo Han, "A Study of Adaptive QoS Routing scheme using Policy-gradient Reinforcement Learning", Journal of the Korea Society of Computer and Information, Vol.16, No.2, pp.93-99, 2011.
DOI
|
5 |
Jongho Kim, Daesung Kang, Jooyoung Park, "Robot Locomotion via RLS-based Actor-Critic Learning", Journal of Korean Institute of Intelligent Systems, Vol.15, No.7, pp.893-898, 2005.
DOI
|
6 |
Arthur Juliani, "Introducing: Unity Machine Learning Agents Toolkit", Unity Blog, https://blogs.unity3d.com/2017/09/19/introducin g-unity-machine-learning-agents/, 2017.
|
7 |
Wooil Shim, Taehwa Park, Kyungjoong Kim, "Comparison of Policy Optimization Reinforcement Learning for Simulated Autonomous Car Environment", Korea Information Science Society, p.833-835, 2018.
|
8 |
Adrian Gonzalez, Ramirez, "Neural networks applied to a tower defense video game", Universitat Jaume I, Grauen Disseny i Desenvolupament de Videojocs [94], 2018.
|
9 |
Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "ML-Agents Toolkit Overview",https://github.com/Unity-Technologies/ml-agents/blob/master/docs/ML-Agents-Overview.md, 2017.
|
10 |
Jaehoon Lee, Taerim Kim, Jonggyu Song, Hyunjae Im, "Flight Trajectory Simulation via Reinforcement Learning in Virtual Environment", Journal of the Korea Society for Simulation, Vol.27, No.4, p.1-8, 2018.
DOI
|
11 |
Sonic, "PPO (Proximal Policy Optimization Algorithms) I Machine Learning & QA)", Naver Blog, https://cafe.naver.com/soynature/2400, 2017.
|
12 |
Saemaro Moon, Yonglak Choi "A Study on Application of Reinforcement Learning Algorithm Using Pixel Data", Journal of Information Technology Services, Vol.15, No.4, pp.85-95, 2016.
DOI
|
13 |
John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov, "Proximal Policy Optimization Algorithms", OpenAI, arxiv.org/pdf/1707.06347, 2017.
|
14 |
RL Korea, "PG Travel Guide", RLKoreaBlog, https://reinforcement-learning-kr.github.io/2018/06/29/0_pg-travel-guide/#, 2018.
|
15 |
Kyeongnam Kim, "ML-Agents Project Organization Unity ML / Unity", Naver Blog, https://blog.naver.com/kkyy0126/221448746477, 2019.
|
16 |
Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "Getting Started with the 3D Balance Ball Environment", https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Getting-Started-with-Balance-Ball.md#observing-training-progress, 2017.
|
17 |
Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "Training with Proximal Policy Optimization", https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md, 2017.
|