论文标题
使用概率自动编码器的银河多频天的贝叶斯分解
Bayesian decomposition of the Galactic multi-frequency sky using probabilistic autoencoders
论文作者
论文摘要
银河系的全天空观察显示出银河系和非半乳酸弥漫发射,例如从星际物质或宇宙微波背景(CMB)。不同的发射器部分叠加在测量中,部分地相互掩盖,有时它们在一定光谱范围内占据主导地位。从光谱数据中的基本辐射组件分解是一个信号重建问题,通常与详细的物理建模和实质性的计算工作相关。我们旨在构建一种有效且自我实施的算法,检测基本光谱信息包含银河系全天空数据涵盖从$γ$ ray到无线电波的光谱频段。利用信息理论的原理,我们开发了一种最新的变量自动编码器,专门针对高斯噪声统计。我们首先得出一个通用生成过程,该过程从低维发射特征引入到观察到的高维数据。我们使用贝叶斯方法制定这些特征的后验分布,并通过变异推断近似此后部。该算法有效地编码了十个潜在特征图中35个银河发射数据集的信息。这些包含以高保真度重建初始数据所需的基本信息,并根据算法对数据再生的重要性进行排名。三个最重要的特征图编码天体物理成分:(1)密集的星际介质(ISM),(2)ISM的热和稀释区域和(3)CMB。机器辅助和数据驱动的尺寸降低光谱数据可以揭示编码输入数据的物理特征。我们的算法只能从天空亮度值中提取密集和稀释的银河系和CMB。
All-sky observations of the Milky Way show both Galactic and non-Galactic diffuse emission, for example from interstellar matter or the cosmic microwave background (CMB). The different emitters are partly superimposed in the measurements, partly they obscure each other, and sometimes they dominate within a certain spectral range. The decomposition of the underlying radiative components from spectral data is a signal reconstruction problem and often associated with detailed physical modeling and substantial computational effort. We aim to build an effective and self-instructing algorithm detecting the essential spectral information contained Galactic all-sky data covering spectral bands from $γ$-ray to radio waves. Utilizing principles from information theory, we develop a state-of-the-art variational autoencoder specialized on the adaption to Gaussian noise statistics. We first derive a generic generative process that leads from a low-dimensional set of emission features to the observed high-dimensional data. We formulate a posterior distribution of these features using Bayesian methods and approximate this posterior with variational inference. The algorithm efficiently encodes the information of 35 Galactic emission data sets in ten latent feature maps. These contain the essential information required to reconstruct the initial data with high fidelity and are ranked by the algorithm according to their significance for data regeneration. The three most significant feature maps encode astrophysical components: (1) The dense interstellar medium (ISM), (2) the hot and dilute regions of the ISM and (3) the CMB. The machine-assisted and data-driven dimensionality reduction of spectral data is able to uncover the physical features encoding the input data. Our algorithm is able to extract the dense and dilute Galactic regions, as well as the CMB, from the sky brightness values only.