屏幕集：用于高分辨率合成乳房X线X射击扫描的多视图深度卷积神经网络

论文标题

屏幕集：用于高分辨率合成乳房X线X射击扫描的多视图深度卷积神经网络

SCREENet: A Multi-view Deep Convolutional Neural Network for Classification of High-resolution Synthetic Mammographic Screening Scans

论文作者

Seyyedi, Saeed, Wong, Margaret J., Ikeda, Debra M., Langlotz, Curtis P.

论文摘要

目的：开发和评估多视图深度学习方法的准确性，以分析数字乳腺乳清合成筛查案例的高分辨率合成乳房X线照片，并评估图像分辨率和训练集大小的准确性。材料和方法：在回顾性研究中，在我们机构中获得的21,264次筛查数字乳房合成（DBT）检查以及相关的放射学报告。来自这些考试的2D合成乳房X线照片图像，具有不同的分辨率和数据集大小，用于训练多视图深卷积神经网络（MV-CNN），以将筛选图像分类为BI-RADS类（BI-RADS 0、1和1），然后在持有考试的评估之前对其进行评估。结果：BI-RADS 0与非BI-RADS 0类的接收器操作特性曲线（AUC）下的面积为0.912，在整个数据集中训练有训练的MV-CNN。该模型的准确性为84.8％，召回95.9％，精度为95.0％。当相同的模型以50％和25％的图像训练时（AUC = 0.877，p = 0.010和0.834，p = 0.009）时，该AUC值降低。同样，当使用1/2和1/4采样的图像训练相同的模型时，性能下降（AUC = 0.870，p = 0.011和0.813，p = 0.009）。结论：这种深度学习模型将高分辨率合成乳房X线摄影扫描分为正常与需要使用数以万计的高分辨率图像进行进一步检查。较小的训练数据集和较低的分辨率图像都会显着下降。

Purpose: To develop and evaluate the accuracy of a multi-view deep learning approach to the analysis of high-resolution synthetic mammograms from digital breast tomosynthesis screening cases, and to assess the effect on accuracy of image resolution and training set size. Materials and Methods: In a retrospective study, 21,264 screening digital breast tomosynthesis (DBT) exams obtained at our institution were collected along with associated radiology reports. The 2D synthetic mammographic images from these exams, with varying resolutions and data set sizes, were used to train a multi-view deep convolutional neural network (MV-CNN) to classify screening images into BI-RADS classes (BI-RADS 0, 1 and 2) before evaluation on a held-out set of exams. Results: Area under the receiver operating characteristic curve (AUC) for BI-RADS 0 vs non-BI-RADS 0 class was 0.912 for the MV-CNN trained on the full dataset. The model obtained accuracy of 84.8%, recall of 95.9% and precision of 95.0%. This AUC value decreased when the same model was trained with 50% and 25% of images (AUC = 0.877, P=0.010 and 0.834, P=0.009 respectively). Also, the performance dropped when the same model was trained using images that were under-sampled by 1/2 and 1/4 (AUC = 0.870, P=0.011 and 0.813, P=0.009 respectively). Conclusion: This deep learning model classified high-resolution synthetic mammography scans into normal vs needing further workup using tens of thousands of high-resolution images. Smaller training data sets and lower resolution images both caused significant decrease in performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题