A framework for adaptive collective communications for heterogeneous hierarchical computing systems

作者:

Highlights:

摘要

Collective communication operations are widely used in MPI applications and play an important role in their performance. However, the network heterogeneity inherent to grid environments represent a great challenge to develop efficient high performance computing applications. In this work we propose a generic framework based on communication models and adaptive techniques for dealing with collective communication patterns on grid platforms. Toward this goal, we address the hierarchical organization of the grid, selecting the most efficient communication algorithms at each network level. Our framework is also adaptive to grid load dynamics since it considers transient network characteristics for dividing the nodes into clusters. Our experiments with the broadcast operation on a real-grid setup indicate that an adaptive framework allows significant performance improvements on MPI collective communications.

论文关键词:Grid computing,Performance modeling,Adaptive techniques,Polyalgorithms,Collective communication,MPI

论文评审过程:Received 1 November 2006, Revised 1 March 2007, Available online 20 July 2007.

论文官网地址:https://doi.org/10.1016/j.jcss.2007.07.010