论文标题
TaskTorrent:C ++中的基于任务的轻巧分布式运行时系统
TaskTorrent: a Lightweight Distributed Task-Based Runtime System in C++
论文作者
论文摘要
我们介绍TaskTorrent,这是C ++中基于任务的轻巧分布式运行时。 TaskTorrent使用参数化任务图来表示任务DAG,并单面活动消息异步触发远程任务。结果,任务DAG是完全分布和并行发现的。它是C ++ 14库,仅取决于MPI。我们解释了API和实施。我们针对Starpu和Scalapack进行了一系列基准测试。微基准表明,与其他解决方案相比,它的开销很小。然后,我们将其应用于两个大型线性代数问题。 TaskTorrent的尺度非常好,到达数千个核心,表现出良好的弱和强尺度。
We present TaskTorrent, a lightweight distributed task-based runtime in C++. TaskTorrent uses a parametrized task graph to express the task DAG, and one-sided active messages to trigger remote tasks asynchronously. As a result the task DAG is completely distributed and discovered in parallel. It is a C++14 library and only depends on MPI. We explain the API and the implementation. We perform a series of benchmarks against StarPU and ScaLAPACK. Micro benchmarks show it has a minimal overhead compared to other solutions. We then apply it to two large linear algebra problems. TaskTorrent scales very well to thousands of cores, exhibiting good weak and strong scalings.