自我监督图像重建方法的验证和概括性无效MRI

论文标题

自我监督图像重建方法的验证和概括性无效MRI

Validation and Generalizability of Self-Supervised Image Reconstruction Methods for Undersampled MRI

论文作者

Yu, Thomas, Hilbert, Tom, Piredda, Gian Franco, Joseph, Arun, Bonanno, Gabriele, Zenkhri, Salim, Omoumi, Patrick, Cuadra, Meritxell Bach, Canales-Rodríguez, Erick Jorge, Kober, Tobias, Thiran, Jean-Philippe

论文摘要

深度学习方法已成为重建先生的采样的最新技术。特别是对于地面真理不可行或不可能的情况，要获取完全采样的数据，重建的自我监管的机器学习方法正在越来越多地使用。但是，在验证此类方法及其概括性方面的潜在问题仍未得到充实。在本文中，我们调查了自我监督算法验证的重要方面，用于重建未采样的MR图像：对前瞻性重建的定量评估，前瞻性和回顾性重建之间的潜在差异，常用的定量衡量标准的适用性和通用性。研究了两种基于自我监督的denoising和先验的深层图像的自我监督算法。将这些方法与使用体内和幻影数据的最小二乘拟合以及压缩传感重建进行了比较。他们的普遍性通过前瞻性采样的数据与培训不同的数据进行了测试。我们表明，相对于回顾性重建/地面真相，前瞻性重建可能表现出严重的失真。此外，与感知度量相比，与像素定量指标的定量指标可能无法准确捕获感知质量的差异。此外，所有方法均显示出泛化的潜力。但是，比其他变化更受解剖/对比度变化的影响。我们进一步表明，无参考图像指标与人类对图像质量的评级很好地对应，以研究概括性。最后，我们证明了一个调整良好的压缩感测重建和学到的denoising在所有数据上都相似地执行。

Deep learning methods have become the state of the art for undersampled MR reconstruction. Particularly for cases where it is infeasible or impossible for ground truth, fully sampled data to be acquired, self-supervised machine learning methods for reconstruction are becoming increasingly used. However potential issues in the validation of such methods, as well as their generalizability, remain underexplored. In this paper, we investigate important aspects of the validation of self-supervised algorithms for reconstruction of undersampled MR images: quantitative evaluation of prospective reconstructions, potential differences between prospective and retrospective reconstructions, suitability of commonly used quantitative metrics, and generalizability. Two self-supervised algorithms based on self-supervised denoising and the deep image prior were investigated. These methods are compared to a least squares fitting and a compressed sensing reconstruction using in-vivo and phantom data. Their generalizability was tested with prospectively under-sampled data from experimental conditions different to the training. We show that prospective reconstructions can exhibit significant distortion relative to retrospective reconstructions/ground truth. Furthermore, pixel-wise quantitative metrics may not capture differences in perceptual quality accurately, in contrast to a perceptual metric. In addition, all methods showed potential for generalization; however, generalizability is more affected by changes in anatomy/contrast than other changes. We further showed that no-reference image metrics correspond well with human rating of image quality for studying generalizability. Finally, we showed that a well-tuned compressed sensing reconstruction and learned denoising perform similarly on all data.

下载PDF全文

下载文献需遵守相关版权规定

论文标题