论文标题
另一个查看半空间深度:带有应用的标志半空间
Another look at halfspace depth: Flag halfspaces with applications
论文作者
论文摘要
半空间深度是多变量空间中非参数统计量的精心研究的工具,自然会诱导分位数的多元概括。相对于度量的点的半空间深度定义为包含给定点的闭合半空间的immim质量。通常,达到该量的封闭半空间不必存在。我们介绍了一个旗帜半空间 - 封闭的半空间与其内部之间的中介。我们证明,半空间的深度也可以在标志半空间方面等效,并且始终存在一个旗帜半空间,其边界通过任何给定的点$ x $传递,并且质量完全等于$ x $的半空间深度。标志半空间使我们能够获得有关半空间深度的理论结果,而无需将绝对连续的措施与包含原子的措施区分开,就像以前经常进行的那样。 FLAG半空间的概念用于陈述对随机样本的半空中中位数的维度的结果。我们证明,在轻度条件下,样本半空间中位数的尺寸为$ d $ - 变量数据不能为$ d-1 $,对于$ d = 2 $,样本半空间中间集必须是二维凸多边形,或一个数据点。后一个结果保证了样品半空间中间形式的计算算法在中值集的中位数$ d = 2 $的情况下,t tukeyregion r package tukeyregion也是如此。
The halfspace depth is a well studied tool of nonparametric statistics in multivariate spaces, naturally inducing a multivariate generalisation of quantiles. The halfspace depth of a point with respect to a measure is defined as the infimum mass of closed halfspaces that contain the given point. In general, a closed halfspace that attains that infimum does not have to exist. We introduce a flag halfspace - an intermediary between a closed halfspace and its interior. We demonstrate that the halfspace depth can be equivalently formulated also in terms of flag halfspaces, and that there always exists a flag halfspace whose boundary passes through any given point $x$, and has mass exactly equal to the halfspace depth of $x$. Flag halfspaces allow us to derive theoretical results regarding the halfspace depth without the need to differentiate absolutely continuous measures from measures containing atoms, as was frequently done previously. The notion of flag halfspaces is used to state results on the dimensionality of the halfspace median set for random samples. We prove that under mild conditions, the dimension of the sample halfspace median set of $d$-variate data cannot be $d-1$, and that for $d=2$ the sample halfspace median set must be either a two-dimensional convex polygon, or a data point. The latter result guarantees that the computational algorithm for the sample halfspace median form the R package TukeyRegion is exact also in the case when the median set is less-than-full-dimensional in dimension $d=2$.