Browse > Article
http://dx.doi.org/10.3837/tiis.2020.12.006

Controller Backup and Replication for Reliable Multi-domain SDN  

Mao, Junli (The 54th Research Institute of CETC)
Chen, Lishui (The 54th Research Institute of CETC)
Li, Jiacong (China Telecom Corporation Limited Research Institute)
Ge, Yi (State Key Laboratory of Networking and Switching Technology Beijing University of Posts and Telecommunications)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.14, no.12, 2020 , pp. 4725-4747 More about this Journal
Abstract
Software defined networking (SDN) is considered to be one of the most promising paradigms in the future. To solve the scalability and performance problem that a single and centralized controller suffers from, the distributed multi-controller architecture is adopted, thus forms multi-domain SDN. In a multi-domain SDN network, it is of great importance to ensure a reliable control plane. In this paper, we focus on the reliability problem of multi-domain SDN against controller failure from perspectives of backup controller deployment and controller replication. We firstly propose a placement algorithm for backup controllers, which considers both the reliability and the cost factors. Then a controller replication mechanism based on shared data storage is proposed to solve the inconsistency between the active and standby controllers. We also propose a shared data storage layout method that considers both reliability and performance. Besides, a fault recovery and repair process is designed based on the controller backup and shared data storage mechanism. Simulations show that our approach can recover and repair controller failure. Evaluation results also show that the proposed backup controller placement approach is more effective than other methods.
Keywords
SDN; Backup Controller Placement; Shared Data Storage; Failure Recovery;
Citations & Related Records
연도 인용수 순위
  • Reference
1 H. Kim and N. Feamster, "Improving network management with software defined networking," IEEE Communications Magazine, vol. 51, no. 2, pp. 114-119, Feb. 2013.   DOI
2 P. H. Isolani, J. A. Wickboldt, C. B. Both, J. Rochol, and L. Z. Granville, "Interactive monitoring, visualization, and configuration of OpenFlow-based SDN," in Proc. of 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM), pp. 207-215, May 2015.
3 T. Hu, Z. Guo, P. Yi, T. Baker, and J. Lan, "Multi-controller Based Software-Defined Networking: A Survey," IEEE Access, vol. 6, pp. 159a80-15996, Mar. 2018,   DOI
4 T. Hu, Z. Guo, P. Yi, T. Baker, and J. Loutievski, M. Zhu, R. Ramanathan, Y. Iwata and H. Inoue, "Onix: A Distributed Control Platform for Large-scale Production Networks," in Proc. of the 9th USENIX OSDI, pp. 351-364, Oct. 2010.
5 A. Tootoonchian and Y. ganjali, "HyperFlow: A Distributed Control Plane for OpenFlow," USENIX INM/WREN, 2010.
6 P. Lin, S. Wolff, Y. Wang, A. Xu, and Z. Chen, "A west-east bridge based SDN inter-domain testbed," IEEE Communications Magazine, vol. 53, no. 2, pp. 190-197, Feb. 2015.   DOI
7 Y. Jimenez, C. Cervello-Pastor and A. J. Garcia, "On the controller placement for designing distributed SDN control layer," in Proc. of Networking Conference, IFIP. Trondheim, pp. 1-9, June 2014.
8 Fan Y , Ouyang T, " Reliability-Aware Controller Placements in Software Defined Networks," in Proc. of 2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems, pp. 2133-2140, 2019.
9 Y. Hu, W. Wang, X. Gong, X. Que, and S. Cheng, "On reliability-optimized controller placement for Software-Defined Networks," Communications, vol. 11, no. 2, pp. 38-54, Feb. 2014.
10 L. F. Muller, R. R. Oliveira, M. C. Luizelli, L. P. Gaspary, and M. P. Barcellos, "Survivor: An enhanced controller placement strategy for improving SDN survivability," in Proc. of 2014 IEEE Global Communications Conference, pp. 1909-1915, Dec. 2014.
11 Y. Zhang, N. Beheshti, and M. Tatipamula, "On Resilience of Split-Architecture Networks," in Proc. of 2011 IEEE Global Telecommunications Conference-GLOBECOM 2011, pp. 1-6, 2011.
12 Y. Wang, Q. Zhong, X. Qiu, and W. Li, "Resource Allocation for Reliable Communication Between Controllers and Switches in SDN," Journal of Network & Systems Management, pp. 966-992, 2018.   DOI
13 P. Fonseca, R. Bennesby, E. Mota, and A. Passito, "A replication component for resilient OpenFlow-based networking," in Proc. of Network Operations and Management Symposium, pp. 933-939, 2012.
14 P. Fonseca, R. Bennesby, E. Mota, and A. Passito, "Resilience of SDNs based on active and passive replication mechanisms," in Proc. of IEEE Global Communications Conference, pp. 188-2193, 2013.
15 E. S. Spalla, D. R. Mafioletti, A. B. Liberato, G. Ewald, C. E. Rothenberg, L. Camargos, R. S. Villace, and M. artinello, "AR2C2: Actively replicated controllers for SDN resilient control plane," in Proc. of NOMS 2016-2016 IEEE/IFIP Network Operations and Management Symposium, pp. 189-196, 2016.
16 N. Katta, H. Zhang, M. Freedman, and J. Rexford, "Ravana: Controller fault-tolerance in softwaredefined networking," in Proc. of The Symposium on SDN Research (SOSR), no. 4, pp. 1-12, 2015.
17 F. A. Botelho, F. M. V. Ramos, D. Kreutz, and A. N. Bessani, "On the Feasibility of a Consistent and Fault-Tolerant Data Store for SDNs," Second European Workshop on Software Defined Networks, IEEE Computer Society, pp. 38-43, 2013.
18 J. Kennedy and R. Eberhart, "Particle swarm optimization," in Proc. of ICNN'95 - International Conference on Neural Networks, vol.4, pp. 1942-1948, 1995.
19 F. Botelho, A. Bessani, F. M. V. Ramos, and P. Ferreira, "On the Design of Practical Fault-Tolerant SDN Controllers," in Proc. of European Workshop on Software Defined Networks, pp. 73-78, 2014.
20 M. M. Tajiki, M. Shojafar, B. Akbari, S. Salsano, and M. Martinello, "Software defined service function chaining with failure consideration for fog computing," Concurrency and Computation Practice and Experience, 2019.
21 S. Orlowski, R. Wessaly, M.Pioro, and A. Tomaszewski, "SNDlib 1.0-Survivable Network Design Library," Networks, vol. 55, no. 3, pp. 276-286, 2019.   DOI
22 N. McKeown, T. Anderson, H. Balakrishnan, G. M. Parulkar, L. L. Peterson, J. Rexford, S. Shenker, and J. S. Turner, "Openflow: enabling innovation in campus networks," Computer Communication Review, vol. 38, no.2, pp. 69-74, 2008.
23 O. N. Foundation. (2011) Openflow switch specification version 1.2.0 (wire protocol 0x04), [Online]. Available: https://www.opennetworking.org/images/stories/downloads/sdnresources/
24 A. Medina, A. Lakhina, I. Matta, and J. Byers, "BRITE: an approach to universal topology generation," in Proc. of Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, pp. 346-353, 2001.
25 P. Gill, N. Jain, and N. Nagappan, "Understanding network failures in data centers: measurement, analysis, and implications," in Proc. of the ACM SIGCOMM 2011, pp. 350-361, 2011.
26 W. Ding, H. Yu, and S. Luo, "Enhancing the reliability of services in NFV with the cost-efficient redundancy scheme," in Proc. of 2017 IEEE International Conference on Communications, pp. 1-6, 2017.
27 B. Heller, R. Sherwood, and N. McKeown, "The controller placement problem," Acm Sigcomm Computer Communication Review, vol. 42, no. 4, pp. 7-12, 2013.
28 J. Zhang, Z. Wang, C. Peng, L. Zhang, T. Huang, and Y. Liu, "RABA: Resource-Aware Backup Allocation for A Chain of Virtual Network Functions," in Proc. of IEEE INFOCOM 2019 - IEEE Conference on Computer Communications, pp. 1918-1926, 2019.
29 J. Kong, I. Kim, X. Wang, Q. Zhang, H.C. Cankaya, and W. Xie, T. Ikeuchi and J. P. Jue, "Guaranteed-Availability Network Function Virtualization with Network Protection and VNF Replication," in Proc. of GLOBECOM 2017-2017 IEEE Global Communications Conference, pp. 1-6, 2017.
30 J. Fan, M. Jiang, and C. Qiao, "Carrier-grade availability-aware mapping of Service Function Chains with on-site backups," in Proc. of 2017 IEEE/ACM 25th International Symposium on Quality of Service (IWQoS), pp. 1-10, 2017.
31 LINDO Systems, Inc. [Online]. Available: http://www.lindo.com/