论文标题

培训人口偏见的数据诊断算法的风险

Risk of Training Diagnostic Algorithms on Data with Demographic Bias

论文作者

Abbasi-Sureshjani, Samaneh, Raumanns, Ralf, Michels, Britt E. J., Schouten, Gerard, Cheplygina, Veronika

论文摘要

机器学习应用程序中的关键挑战之一是有公平的预测。在各个领域中,最近有许多示例令人信服地表明,接受偏见数据集训练的算法很容易导致错误或歧视性结论。在临床应用中,这一点更为至关重要,在临床应用中,预测算法的设计主要基于有限或给定的一组医学图像和人口统计学变量,例如年龄,性别和种族。在这项工作中,我们对MICCAI 2018诉讼进行了调查,以调查医学图像分析应用中的共同做法。令人惊讶的是,我们发现专注于诊断的论文很少描述所使用的数据集的人口统计学,并且诊断纯粹基于图像。为了强调考虑诊断任务中人口统计的重要性,我们使用了可公开可用的皮肤病变数据集。然后,我们证明,尽管训练集相对平衡,但基于年龄和性别的亚组的曲线下区域(AUC)的分类器的性能在0.76至0.91之间。此外,我们表明可以通过在对抗性训练设置中明确使用人口统计变量来学习无偏的特征,从而导致每个亚组得分平衡。最后,我们讨论了这些结果的含义,并为进一步的研究提供了建议。

One of the critical challenges in machine learning applications is to have fair predictions. There are numerous recent examples in various domains that convincingly show that algorithms trained with biased datasets can easily lead to erroneous or discriminatory conclusions. This is even more crucial in clinical applications where the predictive algorithms are designed mainly based on a limited or given set of medical images and demographic variables such as age, sex and race are not taken into account. In this work, we conduct a survey of the MICCAI 2018 proceedings to investigate the common practice in medical image analysis applications. Surprisingly, we found that papers focusing on diagnosis rarely describe the demographics of the datasets used, and the diagnosis is purely based on images. In order to highlight the importance of considering the demographics in diagnosis tasks, we used a publicly available dataset of skin lesions. We then demonstrate that a classifier with an overall area under the curve (AUC) of 0.83 has variable performance between 0.76 and 0.91 on subgroups based on age and sex, even though the training set was relatively balanced. Moreover, we show that it is possible to learn unbiased features by explicitly using demographic variables in an adversarial training setup, which leads to balanced scores per subgroups. Finally, we discuss the implications of these results and provide recommendations for further research.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源