论文标题
标记说明在生物医学图像分析中很重要
Labeling instructions matter in biomedical image analysis
论文作者
论文摘要
生物医学图像分析算法验证取决于参考数据集的高质量注释,该标记指令是关键。尽管它们的重要性,但他们的优化仍然没有得到探索。在这里,我们介绍了对标签指令及其对该领域注释质量的影响的首次系统研究。通过对Miccai协会注册的专业实践和国际比赛的全面检查,我们发现了注释者对标签说明的标签需求及其当前质量和可用性之间的差异。基于对来自四家专业公司的156个注释者和708个亚马逊机械Turk(MTURK)人群工人的14,040张图像的分析,使用具有不同信息密度级别的说明,我们进一步发现,包括示例性图像显着增强注释性能与单纯的描述相比,同时扩展文本得分并不扩展文本得分并不伸展文本得分。最后,专业注释者的表现不断优于mturk人群工人。我们的研究提高了对生物医学图像分析标签指令中质量标准的需求的认识。
Biomedical image analysis algorithm validation depends on high-quality annotation of reference datasets, for which labeling instructions are key. Despite their importance, their optimization remains largely unexplored. Here, we present the first systematic study of labeling instructions and their impact on annotation quality in the field. Through comprehensive examination of professional practice and international competitions registered at the MICCAI Society, we uncovered a discrepancy between annotators' needs for labeling instructions and their current quality and availability. Based on an analysis of 14,040 images annotated by 156 annotators from four professional companies and 708 Amazon Mechanical Turk (MTurk) crowdworkers using instructions with different information density levels, we further found that including exemplary images significantly boosts annotation performance compared to text-only descriptions, while solely extending text descriptions does not. Finally, professional annotators constantly outperform MTurk crowdworkers. Our study raises awareness for the need of quality standards in biomedical image analysis labeling instructions.