Myrinet을 위한 흐름 제어 기능을 갖는 UDP

UDP with Flow Control for Myrinet

  • 발행 : 2003.10.01

초록

클러스터와 같은 네트워크 컴퓨팅 환경에서는 신속하고 신뢰성이 보장되는 데이타 전송이 요구된다. 일반적으로 신뢰성을 보장하기 위해서 사용되는 전송 프로토콜은 TCP이다. TCP는 신뢰성을 보장하기 위해서 혼잡 제어, 흐름 제어, 재전송 둥을 수행한다. 본 논문은 클러스터의 하부 네트워크로서 많이 사용되는 Myrinet을 분석한 결과, 네트워크 혼잡으로 인한 패킷 손실이 Myrinet에서는 발생하지 않음을 새롭게 보인다. 또한 Myrinet에서는 패킷의 순서 뒤바뀜과 손실이 발생하지 않음을 확인한다. 따라서 TCP의 혼잡 제어, 패킷 순서화, 재전송 등과 같은 신뢰성을 위한 기법들은 불필요한 오버헤드를 발생시킨다. 본 논문은 Myrinet에서 신뢰성을 보장하기 위한 최소한의 기능이 흐름 제어임을 보이고, TCP보다 오버헤드가 적은 UDP에 흐름 제어만을 구현한 RUM(Reliable UDP on Myrinet)을 제안한다. 성능 측정결과, RUM은 신뢰성을 보장함과 동시에, TCP보다 최대 45% 높은 처리량을 보이며 UDP와 비슷한 낮은 단방향 지연시간을 가짐을 알 수 있다.

Network-based computing such as cluster computing requires a reliable high-speed transport protocol. TCP is a representative reliable transport protocol on the Internet, which implements many mechanisms, such as flow control, congestion control, and retransmission, for reliable packet delivery. This paper, however, finds out that Myrinet does not incur any packet losses caused by network congestion. In addition, we ascertain that Myrinet supports reliable and ordered packet delivery. Consequently, most of reliable routines implemented in TCP produce unnecessarily additional overheads on Myrinet. In this paper, we show that we can attain the reliability only by flow control on Myrinet and propose a new reliable protocol based on UDP named RUM (Reliable UDP on Myrinet) that performs a flow control. As a result, RUM achieves a higher throughput by 45% than TCP and shows a similar one-way latency to UDP.

키워드

참고문헌

  1. N. Boden, D. Coben, R. Feldeman, A. Kulawik, C. Seitz, J. Seizovic, and W.-K. Su, 'Myrinet-a gigabit-per-second local area network,' IEEE Micro, February 1995 https://doi.org/10.1109/40.342015
  2. P. Kermani and L. Kleinrock, 'Virtual Cutthrough; A New Computer communication switching technique,' Computer Networks, Vol. 3, pp. 267-286, 1979 https://doi.org/10.1016/0376-5075(79)90032-1
  3. The VITA Standards Organization (VSO), Myrinet-on-VME Protocol Specification Draft Standard, VITA 26-199x Draft 1.1, August 1998
  4. S.J. Golestani, 'Congestion-free Communication in high-speed packet networks,' IEEE Trans. on Communications, Vol. 39, No. 12, pages 1802-1812, December, 1991 https://doi.org/10.1109/26.120166
  5. W. Stevens, TCP/IP Illustrated, Volume I: The Protocols, Addison-Wesley, 1994
  6. D. Dunning, G. Regnier, G. NcAlpine, D. Cameron, B. Chubert, F. Berry, A.M. Merritt, E. Gronke, and C. Dodd, 'The Virtual Interface Architecture,' IEEE Micro, Vol. 18, No. 2, pp. 66-76, March/April 1998 https://doi.org/10.1109/40.671404
  7. Myricom Inc., The GM Message Passing System, http;//www.myri.com, January 2000
  8. Information Networks Division;Hewlett-Packard Company, Netperf: A Network performance Benchmark, http;//netperf.org/netperf/NetperfPage.html, February 1995
  9. S. Pakin, M. Lauria, and A. Chien, 'High Performance Messaging on Workstations : Illinois Fast Messages (FM) For Myrinet,' Proceedings of Supercomputing '95, 1995 https://doi.org/10.1145/224170.224360
  10. J. Chase, A. Callatin, and K. Yocum, 'End-System Optimization for High-Speed TCP,' IEEE Communications Magazine, Vol. 39, No. 4, pp. 68-75, April 2001 https://doi.org/10.1109/35.917506
  11. E. He, J. Leigh, O. Yu, and T.A. DeFanti, 'Reliable Blast UDP: Predictable High Performance Bulk Data Transfer,' Proceedings of IEEE Cluster Computing 2002, Chicago, Illinois, September 2002 https://doi.org/10.1109/CLUSTR.2002.1137760
  12. C. Yoo, H.-W. Jin, and S.-C. Kwon, 'Asynchronous UDP,' IEICE Transactions on Communications, Vol. E84-B, No. 12, pp. 3243-3251 December 2001
  13. H.-W. Jin, C. Yoo, and S.-K. Park, 'Stepwise optimizations of UDP/IP on a Gigabit Network,' Proceedings of the 8th International Euro-Par Conference on Parallel Processing. LNCS, Vol. 2400, pp. 745-748, August 2002
  14. H.-W. Jin and C. Yoo, 'Analysis and Enhancement of Pipelining the Protocol Overheads for a High Throughput,' Proceedings of The 2003 International Conference on Parallel and Distributed Processing Techniques and Applications, pp. 935-941, June 2003