Implementation and Performance Evaluation of Software Distributed Shared Memory for SMP Clusters

SMP 클러스터를 위한 소프트웨어 분산 공유메모리의 구현 및 성능 측정

  • 이동현 (한국과학기술원 전자전산학과) ;
  • 이상권 (한국과학기술원 전자전산학과) ;
  • 박소연 (한국과학기술원 전자전산학과) ;
  • 맹승렬 (한국과학기술원 전자전산학과)
  • Published : 2003.08.01

Abstract

Low-cost commodity SMP(Symmetric Multiprocessor) is widely used as a node of cluster system. In this paper, we implement and evaluate the performance of SDSM system for SMP clusters. Our SDSM system provides HLRC(Home-based Lazy Release Consistency) memory consistency model. Our protocol utilize shared memory within same SMP node, so that page fetch and message passing through network can be reduced. It is implemented on 8 node of 2-way Pentium-III SMP interconnected with 100Mbps Fast Ethernet, and uses TCP/IP for transport/network layer protocol. The experiment with eight applications shows that our SMP protocol achieves maximum 33% speedup improvement and 13%-52% reduction of page fetch compared with uniprocessor protocol.

가격대비 성능이 좋은 저가의 상업용 SMP가 클러스터 시스템의 노드로 많이 사용되고 있다. 본 논문에서는 이러한 SMP 클러스터 상에서 KDSM을 확장해 소프트웨어 분산공유메모리를 구현하고 성능을 평가하였다. 본 논문의 SDSM 시스템은 HLRC 메모리 모델을 제공한다. 또한 같은 SMP 노드내에서 실행되는 프로세스 간에는 메모리 공유를 통해 페이지 획득 및 메시지 전달을 줄여 성능을 향상시켰다. 100Mbps Fast Ethernet으로 연결된 8노드의 2-way 펜티엄-III SMP 클러스터 상에서 구현되었고 통신계층은 TCP/IP를 사용한다. 8개의 응용프로그램을 실행시켜 얻은 성능 평가에서는 기존의 단일프로세스 프로토콜과 비교해 최대 33%의 성능 향상과 13%-52%의 페이지 획득 감소가 나타났다.

Keywords

References

  1. K. Li and P. Hudak. Memory coherence in shared virtual memory systems. In Proceeding of the 5th Annual ACM Symposium on Principles of Distributed Computing, August 1986 https://doi.org/10.1145/10590.10610
  2. P. Keleher and A. L. Cox and W. Zwaenepoel. Lazy release consistency for software distributed shared memory . In Proceedings of the 19th Annual Int'l Symposium on Computer Architecture, May 1992 https://doi.org/10.1145/139669.139676
  3. Y. Zhou, L. Iftode, and K. Li. Performance Evaluation of Two Home Based Lazy Release Consistency Protocols for Shared Virtual Memory Systems. In Proceedings of USENIX OSDI, October 1996 https://doi.org/10.1145/238721.238763
  4. A. Cox and S. Dwarkadas and P. Keleher and H. Lu and R. Rajamony and W. Zwaenepoel. Software versus hardware shared memory implementation: A case study. In Proceedings of the 21st Annual Int'l Symposium on Computer Architecture, April 1997 https://doi.org/10.1109/ISCA.1994.288157
  5. A. Erlichson and N. Nuckolls and G. Ghesson and J. Hennessy. SoftFLASH: analyzing the performance of clustered distributed virtual shared memory. In Proceedings of the 7th International Conference on Architectural Support for Programming Languages and Operating Systems, October 1997 https://doi.org/10.1145/237090.237187
  6. R. Stets and S. Dwarkadas and N. Hardavellas and G. Hunt and L. Kontothanassis and S. Parthasarathy and M. Scott. Cashmere 2L: Software coherent shared memory on a clustered remote write network. In Proceedings of the 16th ACM Symposium on Operating Systems Principles, October 1997 https://doi.org/10.1145/268998.266675
  7. D. J. Scale and K. Gharachorloo and A. Aggarwal. Fine grain software distributed shared memory on SMP clusters. In Proceedings of the 4th IEEE Symposium on High Performance Computer Architecture, February 1998 https://doi.org/10.1109/HPCA.1998.650552
  8. R. Samanta and A. Bilas and L. Iftode and J.P. Singh. Home based SVM protocols for SMP clusters: design and performance. In Proceedings of the 4th IEEE Symposium on High Performance Computer Architecture, February 1998 https://doi.org/10.1109/HPCA.1998.650551
  9. S. K. Lee and H. C. Yun and J. W. Lee and S. R. Maeng. Adaptive prefetching technique for shared virtual memory. In Proceedings of 3rd International Workshop on Software Distributed Shared Memory Systems, May 2001
  10. H. C. Yun and S. K. Lee and J. W. Lee and S. R. Maeng. An efficient lock protocol for home based lazy release consistency. In Proceedings of the 3rd International Workshop on Software Distributed Shared Memory Systems, May 2001
  11. K. Gharachorloo, D. Lenoski, P. Gibbons, A. Gupta and J. Hennessy. Memory Consistency and Event Ordering in Scalable Shared Memory Multiprocessors. In Proceedings of the 17th ISCA, May 1990 https://doi.org/10.1145/325164.325102
  12. A. Bilas. Improving the performance of shared virtual memory on system area networks. PhD thesis, Princeton University, November 1998
  13. W. Hu and W. Shi and Z. Tang. Reducing System Overheads in Home based Software DSMs. In Proceedings of the Second Merged Symp. IPPP/SPDP 1999. 1999
  14. S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta. The SPLASH 2 Programs: Characterization and Methodological Considerations. In Proceedings of the 22nd ISCA, May 1995
  15. D. Bailey and J. Barton and T. Lasinski and H. +Simon. The NAS Parallel Benchmarks. Technical Report 103863, NASA July 1993
  16. Honghui Lu and Sandhya Dwarkadas and Alan L. Cox and Willy Zwaenepoel. Quantifying the Performance Differences between PVM and TreadMarks. Journal of Parallel and Distributed Computing, 43(2):65 78, 1997 https://doi.org/10.1006/jpdc.1997.1332
  17. P. Keleher. CVM:The Coherent Virtual Machine. Technical Report 93 215, Department of CS, University of Maryland, September 1995