[KSCI] Korea Science Citation Index Service

Capacitated Fab Scheduling Approximation using Average Reward TD( ${\lambda}$ ) Learning based on System Feature Functions

Choi, Jin-Young (Division of Industrial and Information Systems Engineering, Ajou University)

Publication Information

Journal of Korean Society of Industrial and Systems Engineering / v.34, no.4, 2011 , pp. 189-196 More about this Journal

Abstract

In this paper, we propose a logical control-based actor-critic algorithm as an efficient approach for the approximation of the capacitated fab scheduling problem. We apply the average reward temporal-difference learning method for estimating the relative value functions of system states, while avoiding deadlock situation by Banker's algorithm. We consider the Intel mini-fab re-entrant line for the evaluation of the suggested algorithm and perform a numerical experiment by generating some sample system configurations randomly. We show that the suggested method has a prominent performance compared to other well-known heuristics.

Keywords

Fab Scheduling Problem; Actor-critic; Temporal-difference; Average Reward; Banker's Algorithm; Feature Functions;

Citations & Related Records

Reference

1	Kumar, P. R.; "Scheduling manufacturing systems of re-entrant lines," in Stochastic Modeling and Analysis of Manufacturing Systems, D. D. Yao, Ed. Berlin, Germanyy : Springer-Verlag, 325-360, 1994.
2	Kumar, P. R.; "Scheduling semiconductor manufacturing plants," IEEE Control Syst. Mag, 14(6) : 33-40, 1994.
3	Kumar, S. and Kumar, P. R.; "Fluctuation smoothing policies are stable for stochastic re-entrant lines," Discrete-Event Dynam. Syst., : Theory and Applicat., 6 : 361-370, 1996. DOI ScienceOn
4	Kumar, S. and Kumar, P. R.; "Queueing network models in the design and analysis of semiconductor wafer fabs," IEEE Trans. Robot. Automat., 17(5) : 548-561, 2001. DOI ScienceOn
5	Lu, S. H. and Kumar, P. R.; "Distributed scheduling based on due dates and buffer priorities," IEEE Trans. Autom. Control, 36(12) : 1406-1416, 1991. DOI ScienceOn
6	Lu, S. H., Ramaswamy, D., and Kumar, P. R.; "Efficient scheduling policies to reduce mean and variance of cycle-time in semiconductor manufacturing plants," IEEE Trans. Semicond. Manuf, 7(3) : 374-385, 1994. DOI ScienceOn
7	Puterman, M. L.; Markov Decision Processes : Discrete Stochastic Dynamic Programming, New York : Wiley, 1994.
8	Reveliotis, S. A.; "The destabilizing effect of blocking due to finite buffering capacity in multi-class queueing networks," IEEE Trans. Autom. Control, 45(3) : 585-588, 2000. DOI ScienceOn
9	Reveliotis, S. A.; Real-time management of resource allocation systems, Springer, 2005.
10	Rossetti, M. D., Hill, R. R., Johansson, B., Dunkin, A., and Ingalls, R. G.; "A simulation-based approximate dynamic programming approach for the control of the intel mini-fab benchmark model," In the proc. of the 2009 winter sim. conf., 2009.
11	Sutton, R. S. and Barto, A. G.; Reinforcement Learning (An Introduction), MIT Press, 1999.
12	Tsitsiklis, J. N. and Roy, B. V.; "Feature-based methods for large scale dynamic programming," Machine Learning, 22 : 59-94, 1996.
13	Tsitsiklis, J. N. and Roy, B. V.; "Average cost temporal-difference learning," Automatica, 35 : 1799-1808, 1999. DOI ScienceOn
14	Wein, L. M.; "Scheduling semiconductor wafer fabrication," IEEE Trans. Semicond. Manufact., 1(3) : 115-130, 1988. DOI ScienceOn
15	Jolliffe, I. T.; Principal Component Analysis, Springer-Verlag, 2002.
16	Bertsekas, D. P. and Tsitsiklis, J. N.; Neuro-Dynamic Programming, Athena Scientific, 1996.
17	Choi, J. Y. and Reveliotis, S. A.; "Relative value function approximation for the capacitated re-entrant line scheduling problem," IEEE Trans. Autom. Science and Eng, 2(3) : 285-299, 2005. DOI ScienceOn
18	Choi, J. Y. and Kim, S. B.; "Computationally efficient neuro-dynamic programming approximation method for the capacitated re-entrant line scheduling problem," International Journal of Production Research (accepted).

KSCI

Capacitated Fab Scheduling Approximation using Average Reward TD() Learning based on System Feature Functions 시스템 특성함수 기반 평균보상 TD() 학습을 통한 유한용량 Fab 스케줄링 근사화

Capacitated Fab Scheduling Approximation using Average Reward TD( ${\lambda}$ ) Learning based on System Feature Functions