References
- A.Smith, 'Cache memories,' ACM Computing Surveys, vol.14,pp. 473-530, Sep. 1982 https://doi.org/10.1145/356887.356892
- N. Jouppi, 'Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers,' In Proceedings of the 17 th Annual International Symposium in Computer Architecture, pp.364-373, 1990 https://doi.org/10.1109/ISCA.1990.134547
- J. Baer and T. Chen, 'An effective on-chip preloading scheme to reduce data access penalty,' In Proceedings of Supercomputing '91, pp.176-186, 1991 https://doi.org/10.1145/125826.125932
- J. Baer and T.Chen, 'Reducing memory latency via non-blocking and prefetching caches,' In Proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems, pp.51-61, Oct. 1992 https://doi.org/10.1145/143365.143486
- J. Fu and J. Patel, 'Data prefetching In multiprocessor vector cache memories,' In Proceedings of the 18th Annual International Symposium on Computer Architecture, pp.54-63, 1991
- J. Fu, J. Patel and B. Janssens, 'Stride directed prefetching in scalar processors,' In Proceedings of the 25th International Symposium on Micro-achitecture, pp.102-110, 1992 https://doi.org/10.1145/144953.145006
- F. Dahlgren, M. Dubois and P. Stenstrom, 'Fixed and Adaptive sequential prefetching in shared memory multiprocessors,' In Proceedings of the International Conference on Parallel Processing, pp.56-63,1993 https://doi.org/10.1109/ICPP.1993.92
- A. Porterfield, 'Software methods for improvement of cache performance on supercomputer applications,' In Technical Report COMP TR-89-93, Rice University
- E. Gornish, E. Granston and A. Veidenbaum, 'Compiler-directed data prefetching in multiprocessors with memory hierarchies,' In Proceedings of 1990 International Conference on Supercomputing, pp.354-368, 1990 https://doi.org/10.1145/77726.255176
- T. Mowry and A. Gupta, 'Tolerating latency through software-controlled prefetching in shared-memory multiprocessors,' Journal of Parallel and Distributed Computing, Vol.12, no.2, pp.87-106, 1991 https://doi.org/10.1016/0743-7315(91)90014-Z
- T. Mowry, M. Lam and A. Gupta, 'Design and evaluation of a compiler algorithm for prefetching,' In proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems, pp.62-73, 1992 https://doi.org/10.1145/143365.143488
- J. E. Veenstra and R. J. Fowler, 'MINT: A Front End for Efficient Simulation of Shared-Memory Multiprocessors,' In Proceeding of 2nd International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS), pp 201-207, Jan. 1994 https://doi.org/10.1109/MASCOT.1994.284422
- J. Archibald and J-L. Baer, 'Cache Coherence Protocols : Evaluation Using a Multiprocessors Simulation Model,' ACM Transactions on Computer Systems, Vol. 4, No. 4, pp 273-298, Nov. 1986 https://doi.org/10.1145/6513.6514
- S.C. Woo, M. Ohara, E. Torrie, J.P. Singh and A. Gupta, 'The SPLASH-2 Programs: Characterization and Methodological Considerations,' In Proceedings of the 22th Annual International Symposium on Computer Architecture, pp 24-25, June 1995
- Vipin Kumar, Ananth Grama, Anshul Gupta and George Karypis, 'Introduction to Parallel Computing (Design and Analysis of Algorithms),' The Benjamin/Cummings Publishing Company, Inc., pp.169, pp.179, pp.380, 1994