论文标题
Jack2:一个新的高级通信库,用于并行迭代方法
JACK2: a new high-level communication library for parallel iterative methods
论文作者
论文摘要
在本文中,我们解决了设计分布式应用程序的问题,该应用程序旨在运行经典和异步迭代。 MPI库非常受欢迎,并且在科学界广泛使用,但是异步的迭代方法对有效管理通信请求和缓冲区的有效管理引起了不可忽略的困难。此外,引入了收敛检测问题,它需要实现各种最新终止方法之一,这在大多数计算环境中不一定是高度可靠的。我们在这里提出了一个基于MPI的通信库,该库以非侵入性的方式处理所有这些问题,为实现经典和异步迭代提供了独特的界面。关于我们实现最佳沟通速率并确保准确收敛检测的方法,很少有细节。两个超级计算机的实验结果证实了引入的低间接交流成本以及我们图书馆的有效性。
In this paper, we address the problem of designing a distributed application meant to run both classical and asynchronous iterations. MPI libraries are very popular and widely used in the scientific community, however asynchronous iterative methods raise non-negligible difficulties about the efficient management of communication requests and buffers. Moreover, a convergence detection issue is introduced, which requires the implementation of one of the various state-of-the-art termination methods, which are not necessarily highly reliable for most computational environments. We propose here an MPI-based communication library which handles all these issues in a non-intrusive manner, providing a unique interface for implementing both classical and asynchronous iterations. Few details are highlighted about our approach to achieve best communication rates and ensure accurate convergence detection. Experimental results on two supercomputers confirmed the low overhead communication costs introduced, and the effectiveness of our library.