Browse > Article

Design and Implementation of an SCI-Based Network Cache Coherent NUMA System for High-Performance PC Clustering  

Oh Soo-Cheol (부산대학교 컴퓨터공학과)
Chung Sang-Hwa (부산대학교 컴퓨터공학과)
Abstract
It is extremely important to minimize network access time in constructing a high-performance PC cluster system. For PC cluster systems, it is possible to reduce network access time by maintaining network cache in each cluster node. This paper presents a Network Cache Coherent NUMA (NCC-NUMA) system to utilize network cache by locating shared memory on the PCI bus, and the NCC-NUMA card which is core module of the NCC-NUMA system is developed. The NCC-NUMA card is directly plugged into the PCI slot of each node, and contains shared memory, network cache, shared memory control module and network control module. The network cache is maintained for the shared memory on the PCI bus of cluster nodes. The coherency mechanism between the network cache and the shared memory is based on the IEEE SCI standard. According to the SPLASH-2 benchmark experiments, the NCC-NUMA system showed improvements of 56% compared with an SCI-based cluster without network cache.
Keywords
Network Cache; CC-NUMA; Shared Memory; SCI; PC Cluster; PCI;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Steven Cameron Woo, Moriyoshi Ohara, Evan Torrie, Jaswinder Pal Singh, and Anoop Gupta. 'The SPLASH-2 Programs: Characterization and Methodological Considerations,' In Proceedings of the 22nd International Symposium on Computer Architecture, pp. 24-36, Santa Margherita Ligure, Italy, June 1995
2 Anant Agarwal, Ricardo Bianchini, David Chaiken, Kirk L. Johnson, David Kranz, John Kubiatowicz, Beng-Hong Lim, Kenneth Machkenize, and Donald Yeung, 'The MIT Alewife Machine: A Large-Scale Distributed-Memory Multiprocessor,' MIT/ LCS Memo TM-454, Messachusetts Institute of Technology, 1991
3 R. Clark. 'SCI Interconnect Chipset and Adapter: Building Large Scale Enterprise Servers with Pent-ium Pro SHV Nodes,' White Paper, Data General Corporation, 1999
4 http://www-1.ibm.com/servers/eserver/xseries/numa/ index.html
5 http://www.plxtech.com
6 P. Keleher, S. Dwarkadas, A.L. Cox, and W. Zwaenepoel, 'TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems,' Proceedings of the Winter 94 Usenix Conference, pp. 115-131, January 1994
7 Jeffrey Kuskin, David Ofelt, Mark Heinrich, John Heinlein, Richard Simoni, Kourosh Gharachorloo, John Chapin, David Nakahira, Joel Baxter, Mark Horowitz, Anoop Gupta, Mendel Rosenblum, and John Hennessy, 'The Stanford FLASH Multiprocessor,' Proceedings of the 21st Annual International Symposium on Computer Architecture, 1994   DOI
8 Georg Acher, Wolfgang Karl, and Markus Lebe-recht, 'The TUM PCI/SCI Adapter,' Scalable Coherent Interface/SCI, Architecture and Software for High-Performance Compute Clusters, LNCS State-of-the-Art Survey, October 1999
9 Mario Trams, Wolfgang Rehm, Daniel Balkanski, Stanislav Simeonov, 'Memory Management in a combined VIA/SCI Hardware,' IPDPS 2000 Workshops, pp. 4-15 Cancun, Mexico, May 2000
10 M. Schulz, J. Tao, C. Trinitis, and W. Karl, 'SMiLE: An Integrated, Multi-Paradigm Software Infrastructure for SCI-based Clusters,' 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid), Berlin, Germany , May, 2002   DOI
11 Emmanuel Cecchet, 'Memory Mapped Networks: A New Deal for Distributed Shared Memories? The SciFS Experience,' IEEE International Conference on Cluster Computing (CLUSTER'02), September, 2002   DOI
12 H. Ong and P. A. Farrell, 'Performance Comparison of LAM/MPI, MPICH, and MVICH on a Linux Cluster connected by a Gigabit Ethernet Network,' Proceedings of the 4th Annual Linux Showcase & Conference, Atlanta, Georgia, USA, October 2000
13 Sang-Hwa Chung, Soo-Cheol Oh, Se-Jin Park, Han-Kook Jang, Chi-Jung Ha, 'A CC-NUMA Prototype Card for SCI-Based PC Clustering,' Proceedings of IEEE International Conference on Cluster Computing, Nov. 2000   DOI
14 Sang-Hwa Chung, Soo-Cheol Oh, 'An SCI-Based PC Cluster Utilizing Coherent Network Cache,' Cluster Computing, Vol. 6, Issue. 2, pp. 153-159, Apr. 2003   DOI
15 http://www.dolphinics.no/dolphin2/interconnect/index.html
16 http://www.myri.com
17 IEEE Standard for Scalable Coherent Interface (SCI), IEEE Computer Society, August 1993