Capacitated Fab Scheduling Approximation using Average Reward TD(${\lambda}$) Learning based on System Feature Functions

Choi, Jin-Young;

Journal of Korean Society of Industrial and Systems Engineering (산업경영시스템학회지)

Volume 34 Issue 4
/
Pages.189-196
/
2011
/
2005-0461(pISSN)
/
2287-7975(eISSN)

Society of Korea Industrial and System Engineering (한국산업경영시스템학회)

Capacitated Fab Scheduling Approximation using Average Reward TD(${\lambda}$) Learning based on System Feature Functions

시스템 특성함수 기반 평균보상 TD(${\lambda}$) 학습을 통한 유한용량 Fab 스케줄링 근사화

Choi, Jin-Young (Division of Industrial and Information Systems Engineering, Ajou University)

최진영 (아주대학교 산업정보시스템공학부)

Received : 2011.10.06
Accepted : 2011.12.15
Published : 2011.12.31

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a logical control-based actor-critic algorithm as an efficient approach for the approximation of the capacitated fab scheduling problem. We apply the average reward temporal-difference learning method for estimating the relative value functions of system states, while avoiding deadlock situation by Banker's algorithm. We consider the Intel mini-fab re-entrant line for the evaluation of the suggested algorithm and perform a numerical experiment by generating some sample system configurations randomly. We show that the suggested method has a prominent performance compared to other well-known heuristics.

Keywords

References

Bertsekas, D. P. and Tsitsiklis, J. N.; Neuro-Dynamic Programming, Athena Scientific, 1996.
Choi, J. Y. and Reveliotis, S. A.; "Relative value function approximation for the capacitated re-entrant line scheduling problem," IEEE Trans. Autom. Science and Eng, 2(3) : 285-299, 2005. https://doi.org/10.1109/TASE.2005.849085
Choi, J. Y. and Kim, S. B.; "Computationally efficient neuro-dynamic programming approximation method for the capacitated re-entrant line scheduling problem," International Journal of Production Research (accepted).
Jolliffe, I. T.; Principal Component Analysis, Springer-Verlag, 2002.
Kumar, P. R.; "Scheduling manufacturing systems of re-entrant lines," in Stochastic Modeling and Analysis of Manufacturing Systems, D. D. Yao, Ed. Berlin, Germanyy : Springer-Verlag, 325-360, 1994.
Kumar, P. R.; "Scheduling semiconductor manufacturing plants," IEEE Control Syst. Mag, 14(6) : 33-40, 1994.
Kumar, S. and Kumar, P. R.; "Fluctuation smoothing policies are stable for stochastic re-entrant lines," Discrete-Event Dynam. Syst., : Theory and Applicat., 6 : 361-370, 1996. https://doi.org/10.1007/BF01797136
Kumar, S. and Kumar, P. R.; "Queueing network models in the design and analysis of semiconductor wafer fabs," IEEE Trans. Robot. Automat., 17(5) : 548-561, 2001. https://doi.org/10.1109/70.964657
Lu, S. H. and Kumar, P. R.; "Distributed scheduling based on due dates and buffer priorities," IEEE Trans. Autom. Control, 36(12) : 1406-1416, 1991. https://doi.org/10.1109/9.106156
Lu, S. H., Ramaswamy, D., and Kumar, P. R.; "Efficient scheduling policies to reduce mean and variance of cycle-time in semiconductor manufacturing plants," IEEE Trans. Semicond. Manuf, 7(3) : 374-385, 1994. https://doi.org/10.1109/66.311341
Puterman, M. L.; Markov Decision Processes : Discrete Stochastic Dynamic Programming, New York : Wiley, 1994.
Reveliotis, S. A.; "The destabilizing effect of blocking due to finite buffering capacity in multi-class queueing networks," IEEE Trans. Autom. Control, 45(3) : 585-588, 2000. https://doi.org/10.1109/9.847750
Reveliotis, S. A.; Real-time management of resource allocation systems, Springer, 2005.
Rossetti, M. D., Hill, R. R., Johansson, B., Dunkin, A., and Ingalls, R. G.; "A simulation-based approximate dynamic programming approach for the control of the intel mini-fab benchmark model," In the proc. of the 2009 winter sim. conf., 2009.
Sutton, R. S. and Barto, A. G.; Reinforcement Learning (An Introduction), MIT Press, 1999.
Tsitsiklis, J. N. and Roy, B. V.; "Feature-based methods for large scale dynamic programming," Machine Learning, 22 : 59-94, 1996.
Tsitsiklis, J. N. and Roy, B. V.; "Average cost temporal-difference learning," Automatica, 35 : 1799-1808, 1999. https://doi.org/10.1016/S0005-1098(99)00099-0
Wein, L. M.; "Scheduling semiconductor wafer fabrication," IEEE Trans. Semicond. Manufact., 1(3) : 115-130, 1988. https://doi.org/10.1109/66.4384

Journal of Korean Society of Industrial and Systems Engineering (산업경영시스템학회지)

Capacitated Fab Scheduling Approximation using Average Reward TD(${\lambda}$) Learning based on System Feature Functions

시스템 특성함수 기반 평균보상 TD(${\lambda}$) 학습을 통한 유한용량 Fab 스케줄링 근사화

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)