Browse > Article

Empirical Modeling for Cache Miss Rates in Multiprocessors  

Lee, Kang-Woo (동국대학교 정보통신공학과)
Yang, Gi-Joo (동국대학교 정보통신공학과)
Park, Choon-Shik (동국대학교 정보통신공학과)
Abstract
This paper introduces an empirical modeling technique. This technique uses a set of sample results which are collected from a few small scale simulations. Empirical models are developed by applying a couple of statistical estimation techniques to these samples. We built two types of models for cache miss rates in Symmetric Multiprocessor systems. One is for the changes of input data set size while the specification of target system is fixed. The other is for the changes of the number of processors in target system while the input data set size is fixed. To develop accurate models, we built individual model for every kind of cache misses for each shared data structure in a program. The final model is then obtained by integrating them. Besides, combined use of Least Mean Squares and Robust Estimations enhances the quality of models by minimizing the distortion due to outliers. Empirical modeling technique produces extremely accurate models without analysis on sample data. In addition, since only snail scale simulations are necessary, once a set of samples can be collected, empirical method can be adopted in any research areas. In 17 cases among 24 trials, empirical models present extremely low prediction errors below $1\%$. In the remaining cases, the accuracy is excellent, as well. The models sustain high quality even when the behavioral characteristics of programs are irregular and the number of samples are barely enough.
Keywords
Empirical Modeling; Prediction Model; Statistical Estimation Methods; Cache Misses; Symmetric Multiprocessors;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Parashar, S. Hariri, 'Compile-Time Performance Prediction of HPF/Fortran 90D,' IEEE Performance Evaluation, pp. 57-73, Spring 1996   DOI   ScienceOn
2 M. Dubois, J. Skeppstedt, L. Ricciulli, K. Ramamurthy, P. Stenstrom, 'Detection and Elimination of Useless Misses in Multiprocessors,' Proc. of 20th Annual International Symp. on Computer Architecture, pp. 88-97, May 1993   DOI
3 M. McLaughlin, 'Market Share: Servers, Worldwide, 2004,' Gartner Research, 2004
4 G. Weiss, M. Chuba, 'The Future of the Server: A Five-year Outlook,' Gartner Research, Jul. 2003
5 M. Brorsson, F. Dahlgren, H. Nilsson, and P. Stenstrom, 'The CacheMire Test Bench--A Flexible and Effective Approach for Simulation of Multiprocessors,' Proc. of 26th Annual IEEE International Simulation Symposium, pp. 41-49, Apr. 1993
6 D. Culler, J. P. Singh, A. Gupta, 'Parallel Computer Architecture: A Hardware/Software Approach,' Morgan Kaufmann, 1998
7 M. Dubois, and J. C. Wang, 'Shared Block Contention in a Cache Coherence protocol,' IEEE Transactions on Computers, Vol. 40 No.5, pp. 317-328, May 1991   DOI   ScienceOn
8 A. Gupta, W. Weber, 'Cache Invalidation Patterns in Shared Memory Multiprocessors,' IEEE Trans. on Computers 41(7): pp. 794-810, Jul. 1992   DOI   ScienceOn
9 E. Rothberg, J. P. Singh, A. Gupta, 'Working Sets, Cache Sizes, and Node Granularity Issues for Large-Scale Multiprocessors,' Proc. of 20th Ann. Int'l. Symp, on Computer Architecture, pp. 14-25, May 1993   DOI
10 V. A. Aho, J. E. Hopcroft, J. D. Ullman, 'Data Structures and Algorithms,' Addison-Wesley Publishing Company, 1983
11 S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, A. Gupta, 'The SPLASH-2 Programs: Characterization and Methodological Consideration,' Proc. of 22nd Ann. Int. Symp. on Computer Architecture, pp. 24-36, May 1995   DOI
12 www.intel.com, 'Transitioning to the Intel Itanium Architecture,' Nov. 2003
13 M. I. Hubley, M. Ricjardson, 'Windows 2000 Symmetric Multiprocessing(SMP): Perspective,' Gartner Research, Oct, 2002
14 J. P. Singh, W-D. Weber, A. Gupta, 'SPLASH: Stanford Parallel Applications for SharedMemory,' Compo Arch. News, 20(1):5-44 Mar. 1992   DOI
15 G. Weerasinghe, I. Antonios, L. Lipsky, 'An Analytic Performance Model of Parallel Systems that Perform N Tasks Using P Processors that can Fail,' IEEE Int'l. Symp. on Network Computing and Applications, pp. 310-319, 2001
16 R. E. Matick, 'Comparison of Analytic Performance Models using Closed Mean-value Analysis versus Open-Queueing Theory for Estimating Cycles per Instruction of Memory Hierarchies,' IBM Journal of Research and Development, Vol. 47, Issue 4, pp. 495-517, Jul.2003   DOI   ScienceOn
17 G. Simsion, G. Witt, 'Data Modeling Essentials (The Morgan Kaufmann Series in Data Management Systems) 3rd Edition,' Morgan Kaufmann, 2004
18 A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia, A. Purkastha, 'A Framework for Performance Modeling and Prediction,' Conference on High Performance Networking and Computing, Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, pp. 1-17, 2002
19 D. Kerbyson, A. Hoisie, H. Wasserman, 'Modeling the Performance of Large-Scale Systems,' IEEE Proc. on Software, 150(4), pp. 214-221, Aug. 2003   DOI   ScienceOn
20 I. Gluhovsky, B. O'Krafta, 'Comprehensive Multiprocessor Cache Miss Rate Generation using Multivariate Models,' ACM Trans. on Computer Systems, Vol. 23, No.2, pp. 111-145, May 2005   DOI   ScienceOn
21 R. Hogg, 'Adaptive Robust Estimation,' Journal of American Statistics Association, 69, pp. 909-927, 1974   DOI   ScienceOn
22 R. L. Launer, G. N. Wilkinson, 'Robustness in Statistics,' Academy Press, 1978
23 W. Press, B. Flannery, S. Teukolsky, W. Vetterling, 'Numerical Recipes,' Cambridge University Press, 1986