Browse > Article
http://dx.doi.org/10.3837/tiis.2016.03.002

A Multi-objective Optimization Approach to Workflow Scheduling in Clouds Considering Fault Recovery  

Xu, Heyang (School of Computer Science and Engineering, University of Electronic Science and Technology of China)
Yang, Bo (School of Computer Science and Engineering, University of Electronic Science and Technology of China)
Qi, Weiwei (School of Computer Science and Engineering, University of Electronic Science and Technology of China)
Ahene, Emmanuel (School of Computer Science and Engineering, University of Electronic Science and Technology of China)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.10, no.3, 2016 , pp. 976-995 More about this Journal
Abstract
Workflow scheduling is one of the challenging problems in cloud computing, especially when service reliability is considered. To improve cloud service reliability, fault tolerance techniques such as fault recovery can be employed. Practically, fault recovery has impact on the performance of workflow scheduling. Such impact deserves detailed research. Only few research works on workflow scheduling consider fault recovery and its impact. In this paper, we investigate the problem of workflow scheduling in clouds, considering the probability that cloud resources may fail during execution. We formulate this problem as a multi-objective optimization model. The first optimization objective is to minimize the overall completion time and the second one is to minimize the overall execution cost. Based on the proposed optimization model, we develop a heuristic-based algorithm called Min-min based time and cost tradeoff (MTCT). We perform extensive simulations with four different real world scientific workflows to verify the validity of the proposed model and evaluate the performance of our algorithm. The results show that, as expected, fault recovery has significant impact on the two performance criteria, and the proposed MTCT algorithm is useful for real life workflow scheduling when both of the two optimization objectives are considered.
Keywords
Cloud computing; workflow scheduling; fault recovery; multi-objective optimization; heuristic-based algorithm;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 M. Armbrust, A. Fox, R. Griffith, A.D. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, I. Stoica, M. Zaharia, “A view of cloud computing,” Communications of the ACM, vol. 53, no. 4, pp. 50-58, 2010. Article (CrossRef Link).   DOI
2 X. Yang, B. Nasser, M. Surridge, S. Middleton, “A business-oriented Cloud federation model for real-time applications,” Future Generation Computer Systems, vol. 28, no. 8, pp. 1158-1167, 2012. Article (CrossRef Link).   DOI
3 K. Bessai, S. Youcef, A. Oulamara, C. Godart, "Bi-criteria strategies for business processes scheduling in cloud environments with fairness metrics," in Proc. of IEEE Seventh International Conference on Research Challenges in Information Science, pp. 1-10, 2013. Article (CrossRef Link).
4 S. Abrishami, M. Naghibzadeh, D.H. Epema, “Deadline-constrained workflow scheduling algorithms for infrastructure as a service clouds,” Future Generation Computer Systems, vol. 29, no. 1, pp. 158-169, 2013. Article (CrossRef Link).   DOI
5 M. Malawski, G. Juve, E. Deelman, J. Nabrzyski, “Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds,” Future Generation Computer Systems, vol.48, pp. 1-18, 2015. Article (CrossRef Link).   DOI
6 L. Zeng, B. Veeravalli, X. Li, "ScaleStar: Budget conscious scheduling precedence-constrained many-task workflow applications in cloud," in Proc. of IEEE 26th International Conference on Advanced In-formation Networking and Applications, pp. 534-541, 2012. Article (CrossRef Link).
7 H. Arabnejad, J.G. Barbosa, “A budget constrained scheduling algorithm for workflow applications,” Journal of Grid Computing, vol. 12, no.4, pp. 665-679, 2014. Article (CrossRef Link).   DOI
8 R. Sakellariou, H. Zhao, E. Tsiakkouri, M.D. Dikaiakos, “Scheduling workflows with budget constraints,” Integrated Research in Grid Computing, pp. 189-202, 2007. Article (CrossRef Link).
9 Y. Yuan, X. Li, Q. Wang, X. Zhu, “Deadline division-based heuristic for cost optimization in workflow scheduling,” Information Sciences, vol. 179, no. 15, pp. 2562-2575, 2009. Article (CrossRef Link).   DOI
10 M. Mao, M. Humphrey, "Auto-scaling to minimize cost and meet application deadlines in cloud workflows," in Proc. of 2011 Inter-national Conference for High Performance Computing, Networking, Storage and Analysis. Article (CrossRef Link).
11 M.A. Rodriguez, R. Buyya, “Deadline based resource provisioning and scheduling algorithm for scientific workflows on clouds,” IEEE Transactions on Cloud Computing, vol. 2, no. 2, pp. 222-235, 2014. Article (CrossRef Link).   DOI
12 R.N. Calheiros, R. Buyya, “Meeting deadlines of scientific workflows in public clouds with tasks replication,” IEEE Transactions on Parallel and Distributed Systems, vol. 25, no. 7, pp. 1787-1796, 2013. Article (CrossRef Link).   DOI
13 H. Xu, B. Yang, “An incentive-based heuristic job scheduling algorithm for utility grids,” Future Generation Computer Systems, vol. 49, pp. 1-7, 2015. Article (CrossRef Link).   DOI
14 B. Yang, F. Tan, Y. Dai, S. Guo, "Performance evaluation of cloud service considering fault recovery," in Proc. of the First International Conference on Cloud Computing, pp. 571-576, 2009. Article (CrossRef Link).
15 B. Yang, F. Tan, Y. Dai, “Performance evaluation of cloud service considering fault recovery,” The Journal of Supercomputing, vol. 65, no. 1, pp. 426-444, 2013. Article (CrossRef Link).   DOI
16 C.Q. Wu, X. Lin, D. Yu, W. Xu, L. Li, “End-to-end delay minimization for scientific workflows in clouds under budget constraint,” IEEE Transactions on Cloud Computing, vol. 3, no. 2, pp. 169-181, 2015. Article (CrossRef Link).   DOI
17 H. Topcuoglu, S. Hariri, M. Wu, “Performance-effective and low-complexity task scheduling for heterogeneous computing,” IEEE Transactions on Parallel and Distributed Systems, vol. 13, no. 3, pp. 260-274, 2002. Article (CrossRef Link).   DOI
18 A.C. Zhou, B. He, "Simplified resource provisioning for workflows in IaaS clouds," in Proc. of IEEE 6th International Conference on Cloud Computing Technology and Science, pp. 650-655, 2014. Article (CrossRef Link).
19 J. Zhang, J. Luo, F. Dong, “Scheduling of scientific workflow in non-dedicated heterogeneous multicluster platform,” Journal of Systems and Software, vol. 86, no. 7, pp. 1806-1818, 2013. Article (CrossRef Link).   DOI
20 C. Lin, S. Lu, "Scheduling scientific workflows elastically for cloud computing," in Proc. of IEEE 4th International Conference on Cloud Computing, pp. 246-247, 2011. Article (CrossRef Link).
21 Y.W. Ahn, A.M.K. Cheng, J. Baek, M. Jo, H.H. Chen, “An auto-scaling mechanism for virtual resources to support mobile, pervasive, real-time healthcare applications in cloud computing,” IEEE Network, vol. 27, pp. 62-68, 2013. Article (CrossRef Link).   DOI
22 D. Jung, T. Suh, H. Yu, J. Gil, “A workflow scheduling technique using genetic algorithm in spot instance-based cloud,” KSII Transactions on Internet and Information Systems, vol.8, no. 9, pp. 3126-3145, 2014. Article (CrossRef Link).   DOI
23 J. Sahni, D. Vidyarthi, “A cost-effective deadline-constrained dynamic scheduling algorithm for scientific workflows in a cloud environment,” IEEE Transactions on Cloud Computing, preprint, 2015. Article (CrossRef Link).
24 D. Poola, K. Ramamohanarao, R. Buyya, “Fault-tolerant workflow scheduling using spot instances on clouds,” Procedia Computer Science, vol. 29, pp. 523-533, 2014. Article (CrossRef Link).   DOI
25 D. Poola, S.K. Garg, R.Buyya, Y. Yang, K. Ramamohanarao, "Robust scheduling of scientific workflows with deadline and budget constraints in clouds," in Proc. of IEEE 28th International Conference on Advanced Information Networking and Applications, pp. 858-865, 2014. Article (CrossRef Link).
26 S. Jang, V. Taylor, X. Wu, M. Prajugo, E. Deelman, G. Mehta, K. Vahi, "Performance prediction-based versus load-based site selection: Quantifying the difference," in Proc. of the 18th International Conference on Parallel and Distributed Computing Systems, pp. 148-153, 2005. Article (CrossRef Link)
27 M. De Felice, X. Yao, “Short-term load forecasting with neural network ensembles: A comparative study,” IEEE Computational Intelligence Magazine, vol. 6, no. 3, pp. 47-56, 2011. Article (CrossRef Link).   DOI
28 G. Juve, A. Chervenak, E. Deelman, S. Bharathi, “Characterizing and profiling scientific workflows,” Future Generation Computer Systems, vol. 29, no. 3, pp. 682-692, 2013. Article (CrossRef Link).   DOI
29 Y.S Dai, G. Levitin, K.S. Trivedi, “Performance and reliability of tree-structured grid services considering data dependence and failure correlation,” IEEE Transactions on Computers, vol. 56, no. 7, pp. 925–936, 2007. Article (CrossRef Link).   DOI
30 Y.K. Kwok, I. Ahmad, “Static scheduling algorithms for allocating directed task graphs to multiprocessors,” ACM Computing Surveys, vol. 34, no. 4, pp. 406-470, 1999. Article (CrossRef Link).   DOI
31 Workflow Generator, https://confluence.pegasus.isi.edu/display/pegasus/WorkflowGenerator, 2014.
32 S. Bharathi, A. Chervenak, E. Deelman, G. Mehta, M. Su, K. Vahi, "Characterization of scientific workflows," in Proc. of Third Workshop on Workflows in Support of Large Scale Science, pp. 1-10, 2008. Article (CrossRef Link).
33 R.N. Calheiros, R. Ranjan, A. Beloglazov, C. De Rose, R. Buyya, “CloudSim: A toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms,” Software-Practice and Experience, vol. 41, pp. 23-50, 2011. Article (CrossRef Link).   DOI
34 B. Yang, H. Hu, S. Guo, “Cost-oriented task allocation and hardware redundancy policies in heterogeneous distributed computing systems considering software reliability,” Computers & Industrial Engineering, vol. 56, no. 4, pp. 1687-1696, 2009. Article (CrossRef Link).   DOI
35 A. Verma, S. Kaushal, "Bi-criteria priority based particle swarm optimization workflow scheduling algorithm for cloud," in Proc. of 2014 Recent Advances in Engineering and Computational Sciences, pp. 1-6, 2014. Article (CrossRef Link).
36 K. Deb. Multi-objective optimization. in Search Methodologies, pp. 403-449, Springer, 2014. Article (CrossRef Link).
37 R.T. Marler, J.S. Arora, “Survey of multi-objective optimization methods for engineering,” Structural and Multidisciplinary Optimization, vol. 26, pp. 369-395, 2004. Article (CrossRef Link).   DOI