论文标题

通过识别和消除混杂因素来优化胸部X射线进行图像分析

Optimising Chest X-Rays for Image Analysis by Identifying and Removing Confounding Factors

论文作者

Aslani, Shahab, Lilaonitkul, Watjana, Gnanananthan, Vaishnavi, Raj, Divya, Rangelov, Bojidar, Young, Alexandra L, Hu, Yipeng, Taylor, Paul, Alexander, Daniel C, Jacob, Joseph

论文摘要

在COVID-19大流行期间,在COVID-19诊断的紧急环境中进行的大量成像量导致临床CXR获取的差异很大。在所使用的CXR投影,添加图像注释以及临床图像的旋转程度中可以看到这种变化。图像分析社区试图通过开发自动化的Covid-19诊断算法来减轻大流行期间过度拉伸放射学部门的负担,而COVID-19诊断算法是CXR成像。大量公开可用的CXR数据集已被利用,以改善Covid-19诊断的深度学习算法。然而,公开可用数据集中临床可获得的CXR的可变质量可能会对算法性能产生深远的影响。 COVID-19诊断可以通过图像标签等图像上的非动物特征的算法来推断。这些成像快捷方式可能是数据集特异性的,并限制了AI系统的普遍性。因此,了解和纠正CXR图像中的关键潜在偏差是CXR图像分析之前的重要第一步。在这项研究中,我们提出了一种简单有效的逐步方法,以预处理19 Covid-19胸部X射线数据集以消除不希望的偏见。我们进行消融研究以显示每个单个步骤的影响。结果表明,使用我们提出的管道可能会使基线共证算法的准确性提高到13%。

During the COVID-19 pandemic, the sheer volume of imaging performed in an emergency setting for COVID-19 diagnosis has resulted in a wide variability of clinical CXR acquisitions. This variation is seen in the CXR projections used, image annotations added and in the inspiratory effort and degree of rotation of clinical images. The image analysis community has attempted to ease the burden on overstretched radiology departments during the pandemic by developing automated COVID-19 diagnostic algorithms, the input for which has been CXR imaging. Large publicly available CXR datasets have been leveraged to improve deep learning algorithms for COVID-19 diagnosis. Yet the variable quality of clinically-acquired CXRs within publicly available datasets could have a profound effect on algorithm performance. COVID-19 diagnosis may be inferred by an algorithm from non-anatomical features on an image such as image labels. These imaging shortcuts may be dataset-specific and limit the generalisability of AI systems. Understanding and correcting key potential biases in CXR images is therefore an essential first step prior to CXR image analysis. In this study, we propose a simple and effective step-wise approach to pre-processing a COVID-19 chest X-ray dataset to remove undesired biases. We perform ablation studies to show the impact of each individual step. The results suggest that using our proposed pipeline could increase accuracy of the baseline COVID-19 detection algorithm by up to 13%.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源