论文标题

TaskTorrent:C ++中的基于任务的轻巧分布式运行时系统

TaskTorrent: a Lightweight Distributed Task-Based Runtime System in C++

论文作者

Cambier, Léopold, Qian, Yizhou, Darve, Eric

论文摘要

我们介绍TaskTorrent,这是C ++中基于任务的轻巧分布式运行时。 TaskTorrent使用参数化任务图来表示任务DAG,并单面活动消息异步触发远程任务。结果,任务DAG是完全分布和并行发现的。它是C ++ 14库,仅取决于MPI。我们解释了API和实施。我们针对Starpu和Scalapack进行了一系列基准测试。微基准表明,与其他解决方案相比,它的开销很小。然后,我们将其应用于两个大型线性代数问题。 TaskTorrent的尺度非常好,到达数千个核心,表现出良好的弱和强尺度。

We present TaskTorrent, a lightweight distributed task-based runtime in C++. TaskTorrent uses a parametrized task graph to express the task DAG, and one-sided active messages to trigger remote tasks asynchronously. As a result the task DAG is completely distributed and discovered in parallel. It is a C++14 library and only depends on MPI. We explain the API and the implementation. We perform a series of benchmarks against StarPU and ScaLAPACK. Micro benchmarks show it has a minimal overhead compared to other solutions. We then apply it to two large linear algebra problems. TaskTorrent scales very well to thousands of cores, exhibiting good weak and strong scalings.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源