论文标题
关于在假性模型下系统发育网络的可识别性
On the Identifiability of Phylogenetic Networks under a Pseudolikelihood model
论文作者
论文摘要
生命之树是代表从生命起源到我们今天看到的巨大生物多样性的进化过程的图形结构。由于塑造数据中信号的生物力量的多样性,从基因组序列中重建这棵树是具有挑战性的,许多这些过程(例如不完整的谱系分类和杂交都可以产生混杂的信息。在这里,我们介绍了SNAQ中的假洛基氏模型下的系统发育网络的可识别性证明的数学版本。我们确定检测不同杂交事件的能力取决于杂交斑点上的节点数量,而最难检测到的小斑点(与密切相关的物种相对应)。我们的工作着重于1级网络,但提高了人们对识别性研究对系统发育推理方法的重要性的重要性。
The Tree of Life is the graphical structure that represents the evolutionary process from single-cell organisms at the origin of life to the vast biodiversity we see today. Reconstructing this tree from genomic sequences is challenging due to the variety of biological forces that shape the signal in the data, and many of those processes like incomplete lineage sorting and hybridization can produce confounding information. Here, we present the mathematical version of the identifiability proofs of phylogenetic networks under the pseudolikelihood model in SNaQ. We establish that the ability to detect different hybridization events depends on the number of nodes on the hybridization blob, with small blobs (corresponding to closely related species) being the hardest to be detected. Our work focuses on level-1 networks, but raises attention to the importance of identifiability studies on phylogenetic inference methods for broader classes of networks.