论文标题
在森林中采摘的樱桃:两种系统发育树的无根杂交数量的新特征
Cherry picking in forests: A new characterization for the unrooted hybrid number of two phylogenetic trees
论文作者
论文摘要
系统发育网络是一种特殊的图形类型,可以推广系统发育树,并用于模拟非真正的进化过程,例如重组和杂交。在本文中,我们考虑{\ em无根}系统发育网络,即简单的,连接的图$ \ nathcal {n} =(v,e)$带叶子$ x $的$ x $,用于$ x $的某些物种,其中每个内部顶点in $ \ nathcal {n} $ in $ \ nestion {n} $ inter {n} $都有三个。一种用于构建此类系统发育网络的方法是将$ \ Mathcal {p} $的系统发育树的$作为输入,并寻找一个网络$ \ Mathcal {n} $,其中包含$ \ Mathcal {p} $中的每个树,并将其最小化的量子$ r(ntercal $ r(\ ratecal)最小化。这样的网络始终存在,最佳网络$ \ MATHCAL {n} $的数量$ r(\ Mathcal {n})$称为$ \ Mathcal {p} $的混合数。在本文中,如果$ \ Mathcal {p} $由两棵树组成,则为混合编号提供一个新的特征。这种表征是根据两棵树的樱桃摘要序列给出的,尽管为了证明我们的特征是我们需要更一般地定义两个森林的序列。樱桃采摘序列已经过深入研究,以用于植根的系统发育树的集合,但是我们的新序列是该概念的第一个变体,可以应用于无根环境。由于两棵树的混合数等于两棵树之间的著名树一分为二和重新连接距离,因此我们的新特征还提供了一种理解这一重要树距离的替代方法。
Phylogenetic networks are a special type of graph which generalize phylogenetic trees and that are used to model non-treelike evolutionary processes such as recombination and hybridization. In this paper, we consider {\em unrooted} phylogenetic networks, i.e. simple, connected graphs $\mathcal{N}=(V,E)$ with leaf set $X$, for $X$ some set of species, in which every internal vertex in $\mathcal{N}$ has degree three. One approach used to construct such phylogenetic networks is to take as input a collection $\mathcal{P}$ of phylogenetic trees and to look for a network $\mathcal{N}$ that contains each tree in $\mathcal{P}$ and that minimizes the quantity $r(\mathcal{N}) = |E|-(|V|-1)$ over all such networks. Such a network always exists, and the quantity $r(\mathcal{N})$ for an optimal network $\mathcal{N}$ is called the hybrid number of $\mathcal{P}$. In this paper, we give a new characterization for the hybrid number in case $\mathcal{P}$ consists of two trees. This characterization is given in terms of a cherry picking sequence for the two trees, although to prove that our characterization holds we need to define the sequence more generally for two forests. Cherry picking sequences have been intensively studied for collections of rooted phylogenetic trees, but our new sequences are the first variant of this concept that can be applied in the unrooted setting. Since the hybrid number of two trees is equal to the well-known tree bisection and reconnection distance between the two trees, our new characterization also provides an alternative way to understand this important tree distance.