论文标题
通过深神经网络对CMB的时间订购数据进行分类
Classifying CMB time-ordered data through deep neural networks
论文作者
论文摘要
宇宙微波背景(CMB)已在广泛的多物范围内进行测量。 Atacama Cosmology望远镜(ACT)等电弧分钟分辨率的实验已促进了原发性和继发性各向异性的测量,从而导致了巨大的科学发现。这样的发现需要仔细的数据选择,以消除行为不佳的探测器和不必要的污染物。 ACT使用的当前数据分类方法取决于专家评估和微调的几个统计参数。该方法是高度耗时的,带或特定于季节的方法,这使得它对于将来的CMB实验降低了效率。在这项工作中,我们提出了一个监督的机器学习模型,以对CMB实验的检测器进行分类。该模型对应于深度卷积神经网络。我们使用2008季节(148 GHz)测试了对实际ACT数据的方法,作为ACT数据选择软件提供的标签的培训。该模型学会了直接从原始数据开始对时间流进行分类。对于训练期间考虑的季节和频率,我们发现分类器的精度为99.8%。对于2008季的220和280 GHz数据,我们分别获得了精度的99.4%和97.5%。最后,我们对2009年和2010年的148 GHz数据进行了跨季节测试,我们的模型的精度分别为99.8%和99.5%。我们的模型比当前管道快10倍,使其有可能适合实时实现。
The Cosmic Microwave Background (CMB) has been measured over a wide range of multipoles. Experiments with arc-minute resolution like the Atacama Cosmology Telescope (ACT) have contributed to the measurement of primary and secondary anisotropies, leading to remarkable scientific discoveries. Such findings require careful data selection in order to remove poorly-behaved detectors and unwanted contaminants. The current data classification methodology used by ACT relies on several statistical parameters that are assessed and fine-tuned by an expert. This method is highly time-consuming and band or season-specific, which makes it less scalable and efficient for future CMB experiments. In this work, we propose a supervised machine learning model to classify detectors of CMB experiments. The model corresponds to a deep convolutional neural network. We tested our method on real ACT data, using the 2008 season, 148 GHz, as training set with labels provided by the ACT data selection software. The model learns to classify time-streams starting directly from the raw data. For the season and frequency considered during the training, we find that our classifier reaches a precision of 99.8%. For 220 and 280 GHz data, season 2008, we obtained 99.4% and 97.5% of precision, respectively. Finally, we performed a cross-season test over 148 GHz data from 2009 and 2010 for which our model reaches a precision of 99.8% and 99.5%, respectively. Our model is about 10x faster than the current pipeline, making it potentially suitable for real-time implementations.