Browse > Article
http://dx.doi.org/10.7583/JKGS.2019.19.6.61

A Study about the Usefulness of Reinforcement Learning in Business Simulation Games using PPO Algorithm  

Liang, Yi-Hong (School of Games, Hongik University)
Kang, Sin-Jin (School of Games, Hongik University)
Cho, Sung Hyun (School of Games, Hongik University)
Abstract
In this paper, we apply reinforcement learning in the field of management simulation game to check whether game agents achieve autonomously given goal. In this system, we apply PPO (Proximal Policy Optimization) algorithm in the Unity Machine Learning (ML) Agent environment and the game agent is designed to automatically find a way to play. Five game scenario simulation experiments were conducted to verify their usefulness. As a result, it was confirmed that the game agent achieves the goal through learning despite the change of environment variables in the game.
Keywords
Reinforcement Learning; Proximal Policy Optimization Algorithm; Game Agent;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Sungpill Kim, Deep Learning First Step, pp.17-33, Hanbit Media, 2016.
2 Taewoo Lee, Jinhoo Ryu, Heemin Park "Hovering Control of 1-Axial Drone with Reinforcement Learning", Journal of Korea Multimedia Society, Vol.21, No.2, pp.250-260, 2018.   DOI
3 Daniel R.Jiang, Emmanuel Ekwedike, Han Liu, "Feedback-Based Tree Search for Reinforcement Learning", Journal of Korea Multimedia Society, arXiv:1805.05935, 2018.
4 Jeongsoo Han, "A Study of Adaptive QoS Routing scheme using Policy-gradient Reinforcement Learning", Journal of the Korea Society of Computer and Information, Vol.16, No.2, pp.93-99, 2011.   DOI
5 Jongho Kim, Daesung Kang, Jooyoung Park, "Robot Locomotion via RLS-based Actor-Critic Learning", Journal of Korean Institute of Intelligent Systems, Vol.15, No.7, pp.893-898, 2005.   DOI
6 Arthur Juliani, "Introducing: Unity Machine Learning Agents Toolkit", Unity Blog, https://blogs.unity3d.com/2017/09/19/introducin g-unity-machine-learning-agents/, 2017.
7 Wooil Shim, Taehwa Park, Kyungjoong Kim, "Comparison of Policy Optimization Reinforcement Learning for Simulated Autonomous Car Environment", Korea Information Science Society, p.833-835, 2018.
8 Adrian Gonzalez, Ramirez, "Neural networks applied to a tower defense video game", Universitat Jaume I, Grauen Disseny i Desenvolupament de Videojocs [94], 2018.
9 Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "ML-Agents Toolkit Overview",https://github.com/Unity-Technologies/ml-agents/blob/master/docs/ML-Agents-Overview.md, 2017.
10 Jaehoon Lee, Taerim Kim, Jonggyu Song, Hyunjae Im, "Flight Trajectory Simulation via Reinforcement Learning in Virtual Environment", Journal of the Korea Society for Simulation, Vol.27, No.4, p.1-8, 2018.   DOI
11 Sonic, "PPO (Proximal Policy Optimization Algorithms) I Machine Learning & QA)", Naver Blog, https://cafe.naver.com/soynature/2400, 2017.
12 Saemaro Moon, Yonglak Choi "A Study on Application of Reinforcement Learning Algorithm Using Pixel Data", Journal of Information Technology Services, Vol.15, No.4, pp.85-95, 2016.   DOI
13 John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov, "Proximal Policy Optimization Algorithms", OpenAI, arxiv.org/pdf/1707.06347, 2017.
14 RL Korea, "PG Travel Guide", RLKoreaBlog, https://reinforcement-learning-kr.github.io/2018/06/29/0_pg-travel-guide/#, 2018.
15 Kyeongnam Kim, "ML-Agents Project Organization Unity ML / Unity", Naver Blog, https://blog.naver.com/kkyy0126/221448746477, 2019.
16 Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "Getting Started with the 3D Balance Ball Environment", https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Getting-Started-with-Balance-Ball.md#observing-training-progress, 2017.
17 Arthur Juliani, Vincent-Pierre Berges, Esh Vckay, Yuan Gao, Hunter Henry, Marwan Mattar, Danny Lange, "Training with Proximal Policy Optimization", https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-PPO.md, 2017.