Browse > Article

Call-Site Tracing-based Shared Memory Allocator for False Sharing Reduction in DSM Systems  

Lee, Jong-Woo (숙명여자대학교 멀티미디어과학과)
Abstract
False sharing is a result of co-location of unrelated data in the same unit of memory coherency, and is one source of unnecessary overhead being of no help to keep the memory coherency in multiprocessor systems. Moreover. the damage caused by false sharing becomes large in proportion to the granularity of memory coherency. To reduce false sharing in a page-based DSM system, it is necessary to allocate unrelated data objects that have different access patterns into the separate shared pages. In this paper we propose call-site tracing-based shared memory allocator. shortly CSTallocator. CSTallocator expects that the data objects requested from the different call-sites may have different access patterns in the future. So CSTailocator places each data object requested from the different call-sites into the separate shared pages, and consequently data objects that have the same call-site are likely to get together into the same shared pages. We use execution-driven simulation of real parallel applications to evaluate the effectiveness of our CSTallocator. Our observations show that by using CSTallocator a considerable amount of false sharing misses can be additionally reduced in comparison with the existing techniques.
Keywords
False Sharing; Distributed Shared Memory; Dynamic Memory Allocation; Call Site Tracing;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Steven Cameron Woo, Moriyoshi Ohara, Evan Torrie, Jswinder Pal Singh, and Anoop Gupta. The SPLASH2 Programs: Characterization and Methodological Considerations. In Proceedings of the 22nd Annual International Symposium on Computer Architecture, pages 24-36, June 1995
2 Maged M. Michael. Scalable Lock-Free Dynamic Memory Allocation. In Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation(PLDI'04), June 2004
3 J. E. Veenstra. MINT Tutorial and User Manual. Technical Report TR452, Computer Science Department, University of Rochester, July 1993
4 J. E.Veenstra and R.J.Fowler. 'MINT : A Front End for Efficient Simulation of Shared-Memory Multiprocessors,' Proc., 2nd Int'l Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, pp. 201-207, Jan. 1994   DOI
5 J. P. Singh, W. Weber, and A. Gupta. SPLASH: Stanford Parallel Applications for Shared-Memory. ACM SIGARCH Computer Architecture News, 20(1):5-44, March 1992   DOI
6 Emery D. Berger, Kathryn S. McKinley, Robert D. Blumofe, and Paul R Wilson. Hoard: A scalable memory allocator for multithreaded applications. In Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 117-128, November 2000
7 한부형, 조성제, 조유근, 분산 공유 메모리 시스템에서 거짓 공유 제거 및 통신량 감소 기법. 정보과학회논문지, 25(10):1100-1108, October 1998
8 조성제. 분산 공유 메모리 시스템에서 메모리 참조 패턴에 근거한 거짓 공유 감소 기법. 정보처리논문지, 7(4):1082-1091, April 2000   과학기술학회마을
9 한부형, 조성제. 다중처리기 시스템에서 거짓 공유 완화를 위한 메모리 할당 기법. 정보과학회논문지, 27(4): 383-393, April 2000
10 Emery D. Berger. Memory Management for High-Performance Applications. PhD thesis, University of Texas at Austin, August 2002
11 JongWoo Lee and Yookun Cho. An Effective Shared Memory Allocator for Reducing False Sharing in NUMA Multiprocessors. In Proceedings of 1996 IEEE 2nd International Conference on Algorithms & Architectures for Parallel Processing($JCA^3PP$ '96), pages 373-382, June 1996
12 Roger L. Adema and Carla Schlatter Ellis. Memory Allocation Constructs to Complement NUMA Memory Management. In Proceedings of the 3rd IEEE Symposium on Parallel and Distributed Processing, December 1991
13 이종우, 김문희, 한 장희, 지대구, 윤종완, 김장선, 분산 공유 메모리 시스템에서 동적 공유 메모리 할당 기법이 거짓 공유에 미치는 영향. 정보과학회논문지, 24(12): 1257-1269, December 1997
14 Andrew S. Tanenbaum. Distributed Operating Systems, chapter 6, pages 333-345. PRENTICE HALL,1995
15 이종우, 조유근. NUMA 다중 처리기에서 조정 가능한 지연 카운터를 이용한 페이지 복사 기법. 전자공학회논문지, 33(6):23-33, June 1996   과학기술학회마을
16 Josep Torrellas, Monica S. Lam, and John L. Hennessy. Shared Data Placement Optimizations to Reduce Multiprocessor Cache Miss Rates. In Proceedings of the 1990 International Conference on Parallel Processing, volume II(Software), pages 266-270, August 1990
17 Susan J. Eggers and Tor E. Jeremiassen. Eliminating False Sharing. In Proceedings of the 1991 International Conference on Parallel Processing, volume I(Architecture), pages 377-381, August 1991
18 이종우, 조유근. NUMA 다중 처리기에서 거짓 공유를 줄이는 공유 메모리 할당 기법. 정보처리과학회논문지, 23(5):487-497, May 1996