DOI QR코드

DOI QR Code

Two-Level Scratchpad Memory Architectures to Achieve Time Predictability and High Performance

  • Liu, Yu (Department of Electrical and Computer Engineering, Virginia Commonwealth University) ;
  • Zhang, Wei (Department of Electrical and Computer Engineering, Virginia Commonwealth University)
  • Received : 2014.07.01
  • Accepted : 2014.11.20
  • Published : 2014.12.30

Abstract

In modern computer architectures, caches are widely used to shorten the gap between processor speed and memory access time. However, caches are time-unpredictable, and thus can significantly increase the complexity of worst-case execution time (WCET) analysis, which is crucial for real-time systems. This paper proposes a time-predictable two-level scratchpad-based architecture and an ILP-based static memory objects assignment algorithm to support real-time computing. Moreover, to exploit the load/store latencies that are known statically in this architecture, we study a Scratch-pad Sensitive Scheduling method to further improve the performance. Our experimental results indicate that the performance and energy consumption of the two-level scratchpad-based architecture are superior to the similar cache based architecture for most of the benchmarks we studied.

Keywords

References

  1. L. Wehmeyer and P. Marwedel, "Influence of onchip scratchpad memories on WCET prediction," in Proceedings of the 4th International Workshop on Worst-Case Execution Time (WCET) Analysis, Sicily, Italy, 2004, p. 1-4.
  2. O. Avissar, R. Barua, and D. Stewart, "An optimal memory allocation scheme for scratch-pad-based embedded systems," ACM Transactions on Embedded Computing Systems, vol. 1, no. 1, pp. 6-26, 2002. https://doi.org/10.1145/581888.581891
  3. S. Steinke, L. Wehmeyer, B. S. Lee, and P. Marwedel, "Assigning program and data objects to scratchpad for energy reduction," in Proceedings of the Design, Automation and Test in Europe Conference and Exhibition, Paris, France, 2002, pp. 409-415.
  4. J. F. Deverge and I. Puaut, "WCET-directed dynamic scratchpad memory allocation of data," in Proceedings of the 19th Euromicro Conference on Real-Time Systems (ECRTS2007), Pisa, Italy, 2007, pp. 179-190.
  5. S. Metzlaff, S. Uhrig, J. Mische, and T. Ungerer, "Predictable dynamic instruction scratchpad for simultaneous multithreaded processors," in Proceedings of the 9th Workshop on MEmory Performance: DEaling with Applications, Systems and Architecture (MEDEA2008), Toronto, Canada, 2008, pp. 38-45.
  6. R. Banakar, S. Steinke, B. S. Lee, M. Balakrishnan, and P. Marwedel, "Comparison of cache-and scratch pad based memory systems with respect to performance, area and energy consumption," Technical Report 762, University of Dortmund, Germany, 2001.
  7. Y. Liu and W. Zhang, "Exploiting multi-level scratchpad memories for time-predictable multicore computing," in Proceedings of the IEEE 30th International Conference on Computer Design (ICCD2012), Montreal, Canada, 2012, pp. 61-66.
  8. R. Banakar, S. Steinke, B. S. Lee, M. Balakrishnan, and P. Marwedel, "Scratchpad memory: design alternative for cache on-chip memory in embedded systems," in Proceedings of the 10th International Symposium on Hardware/Software Codesign (CODES2002), Estes Park, CO, 2002, pp. 73-78.
  9. J. Whitham and N. Audsley, "Implementing time-predictable load and store operations," in Proceedings of the 7th ACM International Conference on Embedded Software (EMSOFT2009), Grenoble, France, 2009, pp. 265-274.
  10. T. Lundqvist and P. Stenstrom, "Timing anomalies in dynamically scheduled microprocessors," in Proceedings of the 20th Real-Time Systems Symposium, Phoenix, AZ, 1999, pp. 12-21.
  11. C. R. Hardnett, K. V. Palem, R. M. Rabbah, and W. F. Wong, "Scheduling load operations on VLIW machines," Technical Report GITCC-01-015, Georgia Institute of Technology, Atlanta, GA, 2001.
  12. Trimaran, http://www.trimaran.org.
  13. P. Shivakumar and N. P. Jouppi, "Cacti 3.0: an integrated cache timing, power, and area model," Technical Report 2001/2, Compaq Computer Corporation, Harris County, TX, 2001.
  14. G. Ascia, V. Catania, M. Palesi, and D. Patti, "EPIC-Explorer: a parameterized VLIW-based platform framework for design space exploration," in Proceedings of the 1st Workshop on Embedded Systems for Real-Time Multimedia (ESTImedia), Newport Beach, CA, 2003, pp. 65-72.
  15. CPLEX, http://www.ilog.com/products/cplex/.
  16. Malardalen Real-Time Research Center, "WCET project - benchmarks," http://www.mrtc.mdh.se/projects/wcet/benchmarks.html.

Cited by

  1. Thread-level priority assignment in global multiprocessor scheduling for DAG tasks vol.113, 2016, https://doi.org/10.1016/j.jss.2015.12.004
  2. GPU-SAM: Leveraging multi-GPU split-and-merge execution for system-wide real-time support vol.117, 2016, https://doi.org/10.1016/j.jss.2016.02.009