DOI QR코드

DOI QR Code

Reinforcement leaning based multi-echelon supply chain distribution planning

강화학습 기반의 다단계 공급망 분배계획

  • Kwon, Ick-Hyun (Department of Industrial and Management Engineering, Inje University)
  • 권익현 (인제대학교 산업경영공학과)
  • Received : 2014.07.20
  • Accepted : 2014.12.22
  • Published : 2014.12.31

Abstract

Various inventory control theories have tried to modelling and analyzing supply chains by using quantitative methods and characterization of optimal control policies. However, despite of various efforts in this research filed, the existing models cannot afford to be applied to the realistic problems. The most unrealistic assumption for these models is customer demand. Most of previous researches assume that the customer demand is stationary with a known distribution, whereas, in reality, the customer demand is not known a priori and changes over time. In this paper, we propose a reinforcement learning based adaptive echelon base-stock inventory control policy for a multi-stage, serial supply chain with non-stationary customer demand under the service level constraint. Using various simulation experiments, we prove that the proposed inventory control policy can meet the target service level quite well under various experimental environments.

Keywords

References

  1. Axsater, S., Inventory control, Springer (2006).
  2. Diks, E.B., Kok, A.G. de and Lagodimos, A.G., "Multi-echelon systems: a service measure perspective", European Journal of Operational Research, Vol.95 (1996):pp. 241-263. https://doi.org/10.1016/S0377-2217(96)00120-8
  3. Forteus, E., Foundations of stochastic inventory theory, Stanford University Press (2002).
  4. Gavirneni, S. and Tayur, S., "An efficient procedure for non-stationary inventory control", IIE Transactions, Vol.33 (2001):pp. 83-89.
  5. Jiang, C. and Sheng, Z., "Case-based reinforcement learning for dynamic inventory control in a multi-agent supply-chain system", Expert Systems with Applications, Vol.36 (2009) : pp. 6520-6526. https://doi.org/10.1016/j.eswa.2008.07.036
  6. Kim, C.O., Kwon, I.H. and Baek, J.G., "Asynchronous action-reward learning for nonstationary serial supply chain inventory control", Applied Intelligence, Vol.28, No.1 (2008):pp. 1-16. https://doi.org/10.1007/s10489-007-0038-2
  7. Kwon, I.H., "Distribution planning for a supply chain to satisfy target service level with non-stationary demand", Journal of the Korean Society of Supply Chain Management, Vol.10, No.2 (2010):pp. 81-90.
  8. Kwon, I.H., Kim, C.O., Jun, J. and Lee, J.H., "Case-based myopic reinforcement learning for satisfying target service level in supply chain", Expert Systems with Applications, Vol.35 (2008) : pp. 389-397. https://doi.org/10.1016/j.eswa.2007.07.002
  9. Simchi-Levi, D., Kaminsky, P. and Simchi-Levi, E., Designing and Managing the Supply Chain, McGraw-Hill (2008).
  10. Sutton, R.S. and Barto, A.G., Reinforcement learning, MIT Press (1998).
  11. Tersine, R.J., Principles of inventory and materials management, Prentice-Hall (1994).
  12. Vollmann, T.E., Berry, W.L., Whybark, D.C. and Jacobs, F.R., Manufacturing planning and control for supply chain management, McGraw-Hill (2011).
  13. Zipkin, P.H., Foundation of inventory management, McGraw-Hill (2000).