论文标题
词汇和语音语言历史联合推断的离散矩阵演化的系统发育模型
A Phylogenetic Model of the Evolution of Discrete Matrices for the Joint Inference of Lexical and Phonological Language Histories
论文作者
论文摘要
我们提出了沿系统发育树的基质演变的模型,其中转换会影响基质的整行行或列。这代表了语言数据的词汇和语音方面的变化,通过允许出现新单词并进行系统的语音变化以影响整个词汇。我们实施了一种顺序的蒙特卡洛方法来从后验分布中采样,并共同推断出代表同源出生和语音转换的系统发育,模型参数和潜在变量。我们成功地将此方法应用于中等大小的合成和真实数据。
We propose a model of the evolution of a matrix along a phylogenetic tree, in which transformations affect either entire rows or columns of the matrix. This represents the change of both lexical and phonological aspects of linguistic data, by allowing for new words to appear and for systematic phonological changes to affect the entire vocabulary. We implement a Sequential Monte Carlo method to sample from the posterior distribution, and infer jointly the phylogeny, model parameters, and latent variables representing cognate births and phonological transformations. We successfully apply this method to synthetic and real data of moderate size.