An Asynchronous Algorithm for Balancing Unpredictable Workload on Distributed-Memory Machines

  • Published : 1998.12.15

Abstract

It is challenging to parallelize problems with irregular computation and communication. In this paper, we propose an asynchronous algorithm for balancing unpredictable workload on distributed-memory machines. By using an initial workload estimate, we first partition the computations such that the workload is distributed evenly across the processors. In addition, we perform task migrations dynamically for adapting to the evolving workload. To demonstrate the usefulness of our load balancing strategy, we conducted experiments on an IBM SP2 and a Cray T3D. Experimental results show that our task migration strategy can balance unpredictable workload with little overhead. Our code using C and MPI is portable onto other distributed-memory machines.

Keywords

References

  1. Proc. of Workshop on Solving Irregular Problems on Distributed-Memory Machines, IPPS'96 Ranka, S.(ed.)
  2. Proc. of Workshop on Parallel Algorithms for Irregularly Structured Problems, Irregular'96 Ferreira, A.;Rolim, J.(ed.)
  3. Proc. of Workshop on Solving Irregularly Structured Problems in Parallel, Irregular' 97 Ferreiraa, A.;Rolim, J.(ed.)
  4. Proceedings of IEEE v.84 no.7 High Performance Computing for Vision Wang, C.;Bhat, P.;Prasanna, V.
  5. Proc. of Image Understanding Workshop Detection of Buildings form Monocular Views of Aerial Scenes Using Perceptual Grouping and Shadows Huertas, A.;Lin, C.;Nevatia, R.
  6. IEEE Transactions on Pattern Analysis and Machine Intelligence v.6 no.6 Matching Images Using Linear Features Medioni, G.;Nevatia, R.
  7. Proc. of International Conference on Pattern Recognition Scalable Parallel Implementations of Perceptual Grouping on Connection Machine CM-5 Prasanna, V.;Wang, C.
  8. Proc. of International Parallel Processing Symposium Practical Parallel Algorithms for Dynamic Data Redistribution, Median Finding, and Selection Bader, D.;JaJa, J.
  9. Parallel Computing v.19 An Efficient Heuristic Scheme for Dynamic Remapping of Parallel Computations Choudhary, A.;Narahari, B.;Krishnamurti, R.
  10. Proc. of Scalable High-Performance Computing Conference Performance Evaluation of Load Balancing Algorithms for Parallel Single-Phase Iterative PDE Solvers Chrisochoides, N.;Mansour, N.;Fox, G.
  11. Proc. of Scalable High-Performance Computing Conference Load Balancing Algorithms for the Parallel Community Climate Model Foster, I.;Toonen, B.
  12. IEEE Transactions on Parallel and Distributed Systems v.4 no.9 Load Balancing Requirements in Parallel Implementations of Image Feature Extraction Tasks Gerogiannis, D.;Orphanoudakis, S.
  13. Proc. of Parallel Processing Symposium Implementation and Results of Hypothesis Testing from the C3I Parallel Benchmark Suite VanVoorst, B.;Jha, R.;Pires, L.;Muhammad, M.
  14. Proc. of Symposium on Principles and Practice of Parallel Programming Cilk: An Efficient Multithreaded Runtime System Blumofe, R.;Joerg, C.;Kuszmaul, B.;Leiserson, C.;Randall, K.;Zhou, Y.
  15. Proc. of Workshop on Languages, Compilers, and Runtime Systems for Scalable Computers Runtime Support for Portable Distributed Data Structures Wen, C.;Chakrabarti, S.;Deprit, E.;Krishnamurthy, A.;Yelick, K.
  16. IBM Systems Journal v.34 no.2 SP2 System Architecture Agerwala, T.;Martin, J.;Mirza, J.;Sadler, D.;Dias, D.;Snir, M.
  17. Cray T3D System Architecture Overview Cray Research, Inc.