论文标题
通过卷积神经网络改善医疗图像分割中的校准和分布外检测
Improving Calibration and Out-of-Distribution Detection in Medical Image Segmentation with Convolutional Neural Networks
论文作者
论文摘要
卷积神经网络(CNN)已证明是强大的医学图像分割模型。在这项研究中,我们解决了有关这些模型的一些主要未解决的问题。具体而言,在小型医学图像数据集上对这些模型的培训仍然具有挑战性,许多研究促进了转移学习等技术。此外,这些模型在测试时臭名昭著,用于产生过度自信的预测和默默失败的情况。在本文中,我们主张多任务学习,即,在几个不同的数据集上训练单个模型,涵盖了几个不同感兴趣的器官和不同的成像方式。我们表明,不仅单个CNN学会自动识别上下文并准确地分割每个上下文中感兴趣的器官,而且与在每个数据集中分别训练的专用模型相比,这种联合模型通常具有更准确且更好地校准的预测。我们的实验表明,多任务学习可以超过医疗图像分割任务中的转移学习。对于检测OOD数据,我们提出了一种基于CNN特征图的光谱分析的方法。我们表明,代表不同成像方式和/或不同感兴趣的器官的不同数据集具有不同的光谱签名,这些频谱签名可用于识别测试图像是否类似于用于训练模型的图像。我们表明,基于预测不确定性,这种方法比OOD检测要准确得多。本文提出的方法显着提高了基于CNN的医学图像分割模型的准确性和可靠性。
Convolutional Neural Networks (CNNs) have shown to be powerful medical image segmentation models. In this study, we address some of the main unresolved issues regarding these models. Specifically, training of these models on small medical image datasets is still challenging, with many studies promoting techniques such as transfer learning. Moreover, these models are infamous for producing over-confident predictions and for failing silently when presented with out-of-distribution (OOD) data at test time. In this paper, we advocate for multi-task learning, i.e., training a single model on several different datasets, spanning several different organs of interest and different imaging modalities. We show that not only a single CNN learns to automatically recognize the context and accurately segment the organ of interest in each context, but also that such a joint model often has more accurate and better-calibrated predictions than dedicated models trained separately on each dataset. Our experiments show that multi-task learning can outperform transfer learning in medical image segmentation tasks. For detecting OOD data, we propose a method based on spectral analysis of CNN feature maps. We show that different datasets, representing different imaging modalities and/or different organs of interest, have distinct spectral signatures, which can be used to identify whether or not a test image is similar to the images used to train a model. We show that this approach is far more accurate than OOD detection based on prediction uncertainty. The methods proposed in this paper contribute significantly to improving the accuracy and reliability of CNN-based medical image segmentation models.