论文标题
丰富的皮特曼流程
Enriched Pitman-Yor processes
论文作者
论文摘要
在贝叶斯非参数中,存在各种各样的离散先验,包括Dirichlet流程及其概括,这些过程如今已成为公认的工具。尽管取得了显着的进步,但很少有针对建模在产品空间上的观测值的量身定制的建议,例如$ \ Mathbb {r}^p $。确实,对于多元随机度量,大多数可用的先验缺乏灵活性,并且不允许在空间之间单独的分区结构。我们介绍了一个旨在解决这些问题的离散的非参数先验,称为富集的Pitman-Yor流程(EPY)。该小说先验的理论特性得到了广泛的研究。我们讨论了其与丰富的迪里奇过程的形式联系并进行了归一化的随机度量,我们描述了一个突破的表示形式,并获得了后定律和所涉及的urn方案的封闭形式表达式。在第二位,我们表明了几种现有方法,包括具有尖峰和平板基础测量和混合模型的混合物的差异方法,这些方法隐含地依赖于EPY的特殊情况,因此,这些案例构成了许多贝叶斯非参数检验的统一概率框架。有趣的是,我们的统一配方将使我们能够自然扩展这些模型,同时保持其分析性障碍。作为例证,我们将EPY用于生态学和电子商务应用中的功能聚类中的物种抽样问题。
In Bayesian nonparametrics there exists a rich variety of discrete priors, including the Dirichlet process and its generalizations, which are nowadays well-established tools. Despite the remarkable advances, few proposals are tailored for modeling observations lying on product spaces, such as $\mathbb{R}^p$. Indeed, for multivariate random measures, most available priors lack flexibility and do not allow for separate partition structures among the spaces. We introduce a discrete nonparametric prior, termed enriched Pitman-Yor process (EPY), aimed at addressing these issues. Theoretical properties of this novel prior are extensively investigated. We discuss its formal link with the enriched Dirichlet process and normalized random measures, we describe a square-breaking representation and we obtain closed-form expressions for the posterior law and the involved urn schemes. In second place, we show that several existing approaches, including Dirichlet processes with a spike and slab base measure and mixture of mixtures models, implicitly rely on special cases of the EPY, which therefore constitutes a unified probabilistic framework for many Bayesian nonparametric priors. Interestingly, our unifying formulation will allow us to naturally extend these models while preserving their analytical tractability. As an illustration, we employ the EPY for a species sampling problem in ecology and for functional clustering in an e-commerce application.