Browse > Article
http://dx.doi.org/10.3837/tiis.2014.03.003

FaST: Fine-grained and Scalable TCP for Cloud Data Center Networks  

Hwang, Jaehyun (Bell Labs, Alcatel-Lucent)
Yoo, Joon (Department of Software Design & Management, Gachon University)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.8, no.3, 2014 , pp. 762-777 More about this Journal
Abstract
With the increasing usage of cloud applications such as MapReduce and social networking, the amount of data traffic in data center networks continues to grow. Moreover, these appli-cations follow the incast traffic pattern, where a large burst of traffic sent by a number of senders, accumulates simultaneously at the shallow-buffered data center switches. This causes severe packet losses. The currently deployed TCP is custom-tailored for the wide-area Internet. This causes cloud applications to suffer long completion times towing to the packet losses, and hence, results in a poor quality of service. An Explicit Congestion Notification (ECN)-based approach is an attractive solution that conservatively adjusts to the network congestion in advance. This legacy approach, however, lacks scalability in terms of the number of flows. In this paper, we reveal the primary cause of the scalability issue through analysis, and propose a new congestion-control algorithm called FaST. FaST employs a novel, virtual congestion window to conduct fine-grained congestion control that results in improved scalability. Fur-thermore, FaST is easy to deploy since it requires only a few software modifications at the server-side. Through ns-3 simulations, we show that FaST improves the scalability of data center networks compared with the existing approaches.
Keywords
Scalable congestion control; cloud data center networks; virtual congestion window;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," in Proc. of USENIX OSDI, 2004. .
2 D. Beaver, S. Kumar, H. C. Li, J. Sobel, and P. Vajgel, "Finding a Needle in Haystack: Facebook's Photo Storage," in Proc. of USENIX OSDI, 2010.
3 G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels, "Dynamo: Amazon's Highly Available Key-value Store," in Proc. of ACM SOSP, 2007. .
4 A. Phanishayee, E. Krevat, V. Vasudevan, D. G. Anderson, G. R. Ganger, G. A. Gibson, and S. Seshan, "Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems," in Proc. of USENIX FAST, 2008.
5 Y. Chen, R. Griffith, J. Liu, R. H. Katz, and A. D. Joseph, "Understanding TCP Incast Throughput Collapse in Datacenter Networks," in Proc. of ACM WREN, 2009.
6 V. Vasudevan, A. Phanishayee, H. Shah, E. Krevat, D. G. Anderson, G. R. Ganger, G. A. Gibson, and B. Mueller, "Safe and Effective Finegrained TCP Retransmissions for Datacenter Commu-nication," in Proc. of ACM SIGCOMM, 2009.
7 M. Alizadeh, A. Greenberg, D. A. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan, "Data Center TCP (DCTCP)," in Proc. of ACM SIGCOMM, 2010.
8 H. Wu, Z. Feng, C. Guo, and Y. Zhang, "ICTCP: Incast Congestion Control for TCP in Data Center Networks," in Proc. of ACM CoNEXT, 2010.
9 J. Hwang, J. Yoo, and N. Choi, "IA-TCP: A Rate Based Incast-Avoidance Algorithm for TCP in Data Center Networks," in Proc. of IEEE ICC, 2012.
10 B. Vamanan, J. Hasan, and T. N. Vijaykumar, "Deadline-Aware Datacenter TCP (D2TCP)," in Proc. of ACM SIGCOMM, 2012.
11 C. Wilson, H. Ballani, T. Karagiannis, and A. Rowstron, "Better Never than Late: Meeting Deadlines in Datacenter Networks," in Proc. of ACM SIGCOMM, 2011.
12 D. Zats, T. Das, P. Mohan, and R. Katz, "DeTail: Reducing the Flow Completion Time Tail in Datacenter Networks," in Proc. of ACM SIGCOMM, 2012.
13 K. Ramakrishnan, S. Floyd, and D. Black, "The addition of explicit congestion notification (ECN) to IP," RFC 3168, IETF, 2001
14 D. Nagle, D. Serenyi, and A. Matthews, "The Panasas ActiveScale Storage Cluster-Delivering Scalable High Bandwidth Storage," in Proc. of ACM/IEEE SC2004 Conference, 2004.
15 The ns-3 discrete-event network simulator. [Online]. Available: http://www.nsnam.org/.
16 D. D. Clark, "Window and Acknowledgement Strategy in TCP," RFC 813, IETF, Jul. 1982.
17 T. Benson, A. Akella, and D. A. Maltz, "Network Traffic Characteristics of Data Centers in the Wild," in Proc. of ACM IMC, 2010.
18 R. Jain, "The Art of Computer Systems Performance Analysis," John Wiley & Sons, 1991.