论文标题
贝叶斯分类,异常检测和使用与微生物组应用的网络输入的生存分析
Bayesian classification, anomaly detection, and survival analysis using network inputs with application to the microbiome
论文作者
论文摘要
虽然对单个网络的研究是良好的,但技术进步现在可以相对轻松地收集多个网络。越来越多的任何地方都可以通过脑成像,基因共表达数据或微生物组测量来创建几个到达数千个网络。反过来,这些网络被视为在建模中使用的潜在功能。但是,由于网络本质上是非欧几里得人士,因此如何最好地将它们纳入标准建模任务并不明显。在本文中,我们提出了一个贝叶斯建模框架,该框架为二进制分类,异常检测和使用网络输入提供了统一的方法。我们通过其成对差异在高斯过程的内核中编码网络,并讨论了可以插入模型的几种可证明正面确定核的选择。尽管我们的方法广泛适用,但我们在这里尤其是通过微生物组研究的动机(在这种情况下,网络分析是捕获时间和空间中微生物分类群相互联系的标准方法)及其减少早产和改善产前护理个性化的潜力。
While the study of a single network is well-established, technological advances now allow for the collection of multiple networks with relative ease. Increasingly, anywhere from several to thousands of networks can be created from brain imaging, gene co-expression data, or microbiome measurements. And these networks, in turn, are being looked to as potentially powerful features to be used in modeling. However, with networks being non-Euclidean in nature, how best to incorporate them into standard modeling tasks is not obvious. In this paper, we propose a Bayesian modeling framework that provides a unified approach to binary classification, anomaly detection, and survival analysis with network inputs. We encode the networks in the kernel of a Gaussian process prior via their pairwise differences and we discuss several choices of provably positive definite kernel that can be plugged into our models. Although our methods are widely applicable, we are motivated here in particular by microbiome research (where network analysis is emerging as the standard approach for capturing the interconnectedness of microbial taxa across both time and space) and its potential for reducing preterm delivery and improving personalization of prenatal care.