Mean Field Game based Reinforcement Learning for Weapon-Target Assignment

Shin, Min Kyu;Park, Soon-Seo;Lee, Daniel;Choi, Han-Lim;

doi:10.9766/KIMST.2020.23.4.337

Journal of the Korea Institute of Military Science and Technology (한국군사과학기술학회지)

Volume 23 Issue 4
/
Pages.337-345
/
2020
/
1598-9127(pISSN)
/
2636-0640(eISSN)

The Korea Institute of Military Science and Technology (한국군사과학기술학회)

DOI QR Code

Mean Field Game based Reinforcement Learning for Weapon-Target Assignment

평균 필드 게임 기반의 강화학습을 통한 무기-표적 할당

Shin, Min Kyu (Department of Aerospace Engineering, Korea Advanced Institute of Science and Technology) ;
Park, Soon-Seo (Department of Aerospace Engineering, Korea Advanced Institute of Science and Technology) ;
Lee, Daniel (Department of Aerospace Engineering, Korea Advanced Institute of Science and Technology) ;
Choi, Han-Lim (Department of Aerospace Engineering, Korea Advanced Institute of Science and Technology)

신민규 (한국과학기술원 항공우주공학과) ;
박순서 (한국과학기술원 항공우주공학과) ;
이단일 (한국과학기술원 항공우주공학과) ;
최한림 (한국과학기술원 항공우주공학과)

Received : 2020.04.14
Accepted : 2020.06.26
Published : 2020.08.05

https://doi.org/10.9766/KIMST.2020.23.4.337 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

The Weapon-Target Assignment(WTA) problem can be formulated as an optimization problem that minimize the threat of targets. Existing methods consider the trade-off between optimality and execution time to meet the various mission objectives. We propose a multi-agent reinforcement learning algorithm for WTA based on mean field game to solve the problem in real-time with nearly optimal accuracy. Mean field game is a recent method introduced to relieve the curse of dimensionality in multi-agent learning algorithm. In addition, previous reinforcement learning models for WTA generally do not consider weapon interference, which may be critical in real world operations. Therefore, we modify the reward function to discourage the crossing of weapon trajectories. The feasibility of the proposed method was verified through simulation of a WTA problem with multiple targets in realtime and the proposed algorithm can assign the weapons to all targets without crossing trajectories of weapons.

Keywords

References

Lloyd, S. P. and Witsenhausen, H. S., "Weapon Allocation is NP-complete," Summer Computer Simulation Conference, 1986.
Ahuja, R. K., Kumar, A., Jha, K. C., & Orlin, J. B., "Exact and Heuristic Algorithms for the Weapon-Target Assignment Problem," Operations Research, 55(6), pp. 1136-1146, 2007. https://doi.org/10.1287/opre.1070.0440
Shin, M. K., Lee, D., & Choi, H. L., "Weapon-Target Assignment Problem with Interference Constraints using Mixed-Integer Linear Programming," 2019.
Lee, Z. J., Su, S. F., Lee, C. Y., "Efficiently Solving General Weapon-Target Assignment Problem by Genetic Algorithms with Greedy Eugenics," IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 33, No. 1, pp. 113-121, 2003. https://doi.org/10.1109/TSMCB.2003.808174
Lee, D., Shin, M. K., & Choi, H. L., "Weapon Target Assignment Problem with Interference Constraints," AIAA Scitech 2020 Forum. 2020.
Cho, D. H., & Choi, H. L., "Greedy Maximization for Asset-based Weapon-Target Assignment with Time-Dependent Rewards," Cooperative Control of Multi-Agent Systems: Theory and Applications, pp. 115-139, 2017.
Mnih, Volodymyr, et al., "Playing Atari with Deep Reinforcement Learning," arXiv preprint arXiv:1312.5602, 2013.
Yang, Yaodong, et al., "Mean Field Multi-Agent Reinforcement Learning," arXiv preprint arXiv:1802.05438, 2018.
M. Zhang, J. Zhang, G. Cheng, C. Chen and Z. Liu, "Fire Scheduling for Multiple Weapons Cooperative Engagement," 2016 10th International Conference on Software, Knowledge, Information Management & Applications(SKIMA), Chengdu, pp. 55-60, 2016.
Mouton, H., Roodt, J., & Le Roux, H., "Applying Reinforcement Learning to the Weapon Assignment Problem in Air Defence," Scientia Militaria: South African Journal of Military Studies, 39(2), pp. 99-116, 2011.

Journal of the Korea Institute of Military Science and Technology (한국군사과학기술학회지)

Mean Field Game based Reinforcement Learning for Weapon-Target Assignment

평균 필드 게임 기반의 강화학습을 통한 무기-표적 할당

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)