DOI QR코드

DOI QR Code

Parallelism for Nested Loops with Simple Subscripts

  • Jeong, Sam-Jin (Division of Information and Communication Engineering, BaekSeok University)
  • Published : 2008.12.31

Abstract

In this paper, we propose improved loop splitting method for maximizing parallelism of single loops with non-constant dependence distances. By using the iteration and distance for the source of the first dependence, and by our defined theorems, we present generalized and optimal algorithms for single loops with non-uniform dependences (MPSL). By the extension of the MPSL method, we also apply to exploit parallelism from nested loops with simple subscripts, based on cycle shrinking and loop interchanging method. The algorithms generalize how to transform general single loops with non-uniform dependences as well as nested loops with simple subscripts into parallel loops.

Keywords

References

  1. W. Zhang, G. Chen, M. Kandeemir, and M. Karakoy, "Interprocedural Optimizations for Improvi ng Data Cache Performance of Array-Intensive Embedded Applications," in DAC 2003, Anaheim, Califomia, 2003.
  2. D. S. Park, M. H. Choi, " Interprocedural Transformations for Parallel Computing," in Journal of Korean Multimedia Society, vol. 9, no. 12, pp.1700- 1708, Dec., 2006.
  3. C. D. Ploychronopoulos, "Compiler optimizations for enhancing parallelism and their impact on architecture design," in IEEE Trans. computers, vol. 37, no. 8, pp. 991-1004, Aug. 1988. https://doi.org/10.1109/12.2249
  4. D. E. Knuth, The Art of Computer Programming, vol. 2: Seminumerical Algorithms, Reading, MA: Addison-Wesley, 1981.
  5. H. Zima and B. Chapman, Supercompilers for Parallel and Vector Computers, Addison-Wesley, 1991.
  6. U. Banerjee, R. Eigenmann, A. Nicolau, and D. A. Padua, "Automatic program parallelization," in Proceedings of the IEEE, vol. 81, no. 2, pp.211-243, Feb 1993. https://doi.org/10.1109/5.214548
  7. M. J. Wolfe, Optimizing Supercompilers for Supercomputers, Cambridge, MA: MIT Press, 1989.
  8. S. J. Jeong, "A Loop Transformation for Parallelism from Single Loops," in International Journal of Contents, vol. 2, No.4, pp.8- 11 , Dec. 2006.
  9. W. Shang, T. O' Keefe, and J. A. B. Fortes, "On loop transfonnations for generalized cycle shrinking," in IEEE Trans. Parallel and Distributed Systems, vol. 5, no. 2, pp. 193-204, Feb. 1994. https://doi.org/10.1109/71.265946
  10. M. E. Wolfe, and M. S. Lam, "A loop transformation theory and algorithm to maximize parallelism," in IEEE Trans. Parallel and Distributed Systems, vol. 2, no. 4, pp. 452-471 , Oct. 1991. https://doi.org/10.1109/71.97902