论文标题

积极的深度学习方法,用于发现大型光谱调查中感兴趣的对象

Active deep learning method for the discovery of objects of interest in large spectroscopic surveys

论文作者

Škoda, Petr, Podsztavek, Ondřej, Tvrdík, Pavel

论文摘要

路线望远镜的当前档案包含数百万个管道加工的光谱,这些光谱可能从未被人眼看到。但是,大多数具有有趣物理特性的稀有物体只能通过视觉分析其特征性光谱特征来识别。互动可视化与现代机器学习技术的正确组合为发现此类对象开辟了新的方法。我们应用深度卷积网络支持的主动学习分类,以自动识别数百万个光谱档案中的复杂发射线形状。 我们使用了由定制设计的深卷积神经网络驱动的基于池的不确定性采样主动学习,其12层灵感来自VGGNET,ALEXNET和ZFNET,但适用于一维功能向量。 Lamost DR2调查中的410万个光谱代表了未标记的泳池集。该网络的初始训练是在ondEjov天文台的2000万Perek望远镜左右在该区域获得的大约13000个光谱集进行的,该光谱大部分包含BE和相关的早期型恒星。高斯模糊弥补了OndùEjov中间分辨率与杆型低分辨率光谱仪之间的差异。 经过几次迭代后,该网络能够成功识别出小于6.5%的误差的发射线恒星。使用虚拟天文台的技术可视化结果,我们发现了948个发射线对象的新候选物的1013个光谱,除664个光谱中,在Simbad中列出了549个对象,在Simbad中列出了2644个光谱和2291个对象的2291个对象,这些对象在Wen Hou的中国较早论文中鉴定出来。详细讨论了具有异常光谱属性的最有趣的对象。

Current archives of the LAMOST telescope contain millions of pipeline-processed spectra that have probably never been seen by human eyes. Most of the rare objects with interesting physical properties, however, can only be identified by visual analysis of their characteristic spectral features. A proper combination of interactive visualisation with modern machine learning techniques opens new ways to discover such objects. We apply active learning classification supported by deep convolutional networks to automatically identify complex emission-line shapes in multi-million spectra archives. We used the pool-based uncertainty sampling active learning driven by a custom-designed deep convolutional neural network with 12 layers inspired by VGGNet, AlexNet, and ZFNet, but adapted for one-dimensional feature vectors. The unlabelled pool set is represented by 4.1 million spectra from the LAMOST DR2 survey. The initial training of the network was performed on a labelled set of about 13000 spectra obtained in the region around H$α$ by the 2m Perek telescope of the Ondřejov observatory, which mostly contains spectra of Be and related early-type stars. The differences between the Ondřejov intermediate-resolution and the LAMOST low-resolution spectrographs were compensated for by Gaussian blurring. After several iterations, the network was able to successfully identify emission-line stars with an error smaller than 6.5%. Using the technology of the Virtual Observatory to visualise the results, we discovered 1013 spectra of 948 new candidates of emission-line objects in addition to 664 spectra of 549 objects that are listed in SIMBAD and 2644 spectra of 2291 objects identified in an earlier paper of a Chinese group led by Wen Hou. The most interesting objects with unusual spectral properties are discussed in detail.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源