论文标题
基于仿真的HPC应用程序的性能预测:HPL的案例研究
Simulation-Based Performance Prediction of HPC Applications: A Case Study of HPL
论文作者
论文摘要
我们提出了一种基于模拟的方法,用于在高性能计算平台上对并行应用的性能建模。我们的方法可以实现全系统性能建模:(1)硬件平台由抽象但高保真的模型表示; (2)在功能级别上模拟计算和通信组件,其中模拟器允许使用组件本机接口;这将导致(3)快速准确模拟完整的HPC应用程序,对应用程序源代码进行了最小的修改。这种硬件/软件混合建模方法可提供低开销,快速且准确的Exascale模拟,并且可以在标准客户端平台(台式机或笔记本电脑)上轻松进行。我们使用高性能Linpack(HPL)演示了方法的功能和可扩展性,该基准用于排名前500列表中的超级计算机。我们的结果表明,我们的建模方法可以准确有效地预测HPL在TOP500列表超级计算机的规模上的性能。例如,Frontera上HPL的模拟少于五个小时,错误率为4%。
We propose a simulation-based approach for performance modeling of parallel applications on high-performance computing platforms. Our approach enables full-system performance modeling: (1) the hardware platform is represented by an abstract yet high-fidelity model; (2) the computation and communication components are simulated at a functional level, where the simulator allows the use of the components native interface; this results in a (3) fast and accurate simulation of full HPC applications with minimal modifications to the application source code. This hardware/software hybrid modeling methodology allows for low overhead, fast, and accurate exascale simulation and can be easily carried out on a standard client platform (desktop or laptop). We demonstrate the capability and scalability of our approach with High Performance LINPACK (HPL), the benchmark used to rank supercomputers in the TOP500 list. Our results show that our modeling approach can accurately and efficiently predict the performance of HPL at the scale of the TOP500 list supercomputers. For instance, the simulation of HPL on Frontera takes less than five hours with an error rate of four percent.