On optimal message vector length for block single parallel partition algorithm in a three-dimensional ADI solver
作者:
Highlights:
•
摘要
It has long been recognized that many direct parallel tridiagonal solvers are only efficient for solving a single tridiagonal equation of large sizes, and they become inefficient when naively used in a three-dimensional ADI solver. In order to improve the parallel efficiency of an ADI solver using a direct parallel solver, we implement the single parallel partition (SPP) algorithm in conjunction with message vectorization, which aggregates several communication messages into one to reduce the communication costs. The measured performances show that the longest allowable message vector length (MVL) is not necessarily the best choice. To understand this observation and optimize the performance, we propose an improved model that takes the cache effect into consideration. The optimal MVL for achieving the best performance is shown to depend on number of processors and grid sizes. Similar dependence of the optimal MVL is also found for the popular block pipelined method.
论文关键词:Tridiagonal equation,Single parallel partition,ADI scheme,Message vectorization,Optimal message vector length
论文评审过程:Available online 4 September 2009.
论文官网地址:https://doi.org/10.1016/j.amc.2009.08.052