混合距离定义“像我的人”

论文标题

混合距离定义“像我的人”

A blended distance to define "people-like-me"

论文作者

Fopma, Anaïs, Cai, Mingyang, van Buuren, Stef, Vink, Gerko

论文摘要

曲线匹配是一种依赖预测平均匹配的预测技术，该技术与基于预测距离最相似的捐赠者匹配。即使这种方法导致了高预测的准确性，但预测距离可能会使匹配看起来令人信服，因为匹配捐赠者的概况可能与目标的概况有根本不同。为了平衡这一点，可以通过将预测距离与玛哈拉诺省距离组合到“混合距离”度量中来考虑捐助者和目标之间的相似性。在两项模拟研究中评估了该措施的特性。模拟研究I评估了在不同数据生成条件下混合距离的性能。结果表明，朝着马哈拉诺邦距离融合会导致偏见，覆盖和预测能力方面的性能较差。仿真研究II在估算单个值的情况下评估混合度量。结果表明，混合的财产是偏见变化的权衡。给马哈拉氏症距离增加重量会导致弹药的差异较小，但准确性也较小。主要的结论是，通过预测距离实现的高预测准确性必须使捐赠者概况的变化。

Curve matching is a prediction technique that relies on predictive mean matching, which matches donors that are most similar to a target based on the predictive distance. Even though this approach leads to high prediction accuracy, the predictive distance may make matches look unconvincing, as the profiles of the matched donors can substantially differ from the profile of the target. To counterbalance this, similarity between the curves of the donors and the target can be taken into account by combining the predictive distance with the Mahalanobis distance into a `blended distance' measure. The properties of this measure are evaluated in two simulation studies. Simulation study I evaluates the performance of the blended distance under different data-generating conditions. The results show that blending towards the Mahalanobis distance leads to worse performance in terms of bias, coverage, and predictive power. Simulation study II evaluates the blended metric in a setting where a single value is imputed. The results show that a property of blending is the bias-variance trade off. Giving more weight to the Mahalanobis distance leads to less variance in the imputations, but less accuracy as well. The main conclusion is that the high prediction accuracy achieved with the predictive distance necessitates the variability in the profiles of donors.

下载PDF全文

下载文献需遵守相关版权规定

论文标题