论文标题
线性时间传播动力学系统的Galerkin模型降低的计算结合公式
A compute-bound formulation of Galerkin model reduction for linear time-invariant dynamical systems
论文作者
论文摘要
这项工作旨在推进线性时间不变(LTI)动力学系统的基于投影基于投影的减少订单模型(ROM)的计算方法。对于此类系统,当前的实践依赖于将状态表达为等级1张量的ROM公式(即向量),从而导致计算内核是内存带宽绑定的,因此不适合在现代多核和混合计算节点上可扩展性能。在解决许多经常研究时,这种弱点可能尤其有限,那里需要进行大量模拟。这项工作介绍了用于LTI动力系统的Galerkin ROM的重新印象,称为Rank-2 Galerkin,该系统将ROM问题的性质从内存带宽转换为计算界限。我们介绍了配方及其实施的细节,并通过使用数值实验来证明其实用性,作为测试用例,在轴对称域中对弹性地震剪切波进行了模拟。我们为各种线程和问题大小量化和分析性能和缩放结果。最后,我们提出了使用Rank-2 Galerkin ROM进行蒙特卡洛采样研究的端到端演示。我们表明,等级2 Galerkin ROM比等级-1 Galerkin ROM(当前的实践)高一个数量级,并且比完整阶模型高约970倍,同时保持该领域的平均值和统计数据的准确性。
This work aims to advance computational methods for projection-based reduced order models (ROMs) of linear time-invariant (LTI) dynamical systems. For such systems, current practice relies on ROM formulations expressing the state as a rank-1 tensor (i.e., a vector), leading to computational kernels that are memory bandwidth bound and, therefore, ill-suited for scalable performance on modern many-core and hybrid computing nodes. This weakness can be particularly limiting when tackling many-query studies, where one needs to run a large number of simulations. This work introduces a reformulation, called rank-2 Galerkin, of the Galerkin ROM for LTI dynamical systems which converts the nature of the ROM problem from memory bandwidth to compute bound. We present the details of the formulation and its implementation, and demonstrate its utility through numerical experiments using, as a test case, the simulation of elastic seismic shear waves in an axisymmetric domain. We quantify and analyze performance and scaling results for varying numbers of threads and problem sizes. Finally, we present an end-to-end demonstration of using the rank-2 Galerkin ROM for a Monte Carlo sampling study. We show that the rank-2 Galerkin ROM is one order of magnitude more efficient than the rank-1 Galerkin ROM (the current practice) and about 970X more efficient than the full order model, while maintaining accuracy in both the mean and statistics of the field.