论文标题

线路辐射转移的LOC程序

LOC program for line radiative transfer

论文作者

Juvela, M.

论文摘要

辐射转移建模是许多天体物理模拟的一部分,用于进行合成观测和协助分析观测。我们集中于星际介质发出的无线电线的建模。与高分辨率模型有关,这可能是重大的计算挑战。 我们的目标是一个线辐射转移(RT)程序,该程序可以充分利用多核CPU和GPU。并行化对于加快计算和实现个人计算机来处理大型建模任务至关重要。 该程序LOC基于射线追踪,并使用标准的加速Lambda迭代(ALI)方法来更快地收敛。该程序在1D和3D网格上工作。 1D版本利用对称性来加快RT计算。 3D版本可与OctRee网格一起使用,并且为了使用大型模型启用计算,可用于低内存使用情况。 测试表明,LOC给出了与其他RT代码一致的结果,达到约2%以内。这是代码对代码差异的典型特征,这通常与模型设置的不同解释有关。 LOC运行时间与蒙特卡洛代码的时间相比有利。在1D测试中,LOC运行在GPU上的速度比单个CPU核心快约20。尽管进行了复杂的路径计算,但对于使用OCTREE离散化的3D模型也观察到了高达〜10的速度。 GPU允许在大规模模拟星际云的大规模模拟中遇到的数亿个单元的模型计算。 LOC显示出良好的性能和准确性,并且能够在个人计算机上处​​理许多RT建模任务。它以Python的形式编写,并以计算密集型零件为编译的OpenCL内核,也可以用作替代RT实现的进一步实验的平台。

Radiative transfer modelling is part of many astrophysical simulations and is used to make synthetic observations and to assist analysis of observations. We concentrate on the modelling of the radio lines emitted by the interstellar medium. In connection with high-resolution models, this can be significant computationally challenge. Our goal is a line radiative transfer (RT) program that makes good use of multi-core CPUs and GPUs. Parallelisation is essential to speed up computations and to enable the tackling of large modelling tasks with personal computers. The program LOC is based on ray-tracing and uses standard accelerated lambda iteration (ALI) methods for faster convergence. The program works on 1D and 3D grids. The 1D version makes use of symmetries to speed up the RT calculations. The 3D version works with octree grids and, to enable calculations with large models, is optimised for low memory usage. Tests show that LOC gives results that are in agreement with other RT codes to within ~2%. This is typical of code-to-code differences, which often are related to different interpretations of the model set-up. LOC run times compare favourably with those of Monte Carlo codes. In 1D tests, LOC runs were by up to a factor ~20 faster on a GPU than on a single CPU core. In spite of the complex path calculations, up to ~10 speed-up was observed also for 3D models using octree discretisation. GPUs enable calculations of models with hundreds of millions of cells, as encountered in the context of large-scale simulations of interstellar clouds. LOC shows good performance and accuracy and and is able to handle many RT modelling tasks on personal computers. Being written in Python, with the computing-intensive parts implemented as compiled OpenCL kernels, it can also a serve as a platform for further experimentation with alternative RT implementations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源