使用记忆有效的FCN从全尺寸CT图像进行器官分割

论文标题

使用记忆有效的FCN从全尺寸CT图像进行器官分割

Organ Segmentation From Full-size CT Images Using Memory-Efficient FCN

论文作者

Wang, Chenglong, Oda, Masahiro, Mori, Kensaku

论文摘要

在这项工作中，我们提出了一种与多种内存优化技术合并的充分卷积网络（FCN），以减少训练阶段的运行时GPU内存需求。在医学图像分割任务中，子体积的种植已成为常见的预处理。将亚参数（或较小的贴片量）裁剪以减少GPU记忆需求。但是，较小的贴片量捕获了较少的空间上下文，从而导致精度降低。作为一项试点研究，这项工作的目的是提出一种记忆效率的FCN，使我们能够直接在无需子量裁剪的情况下直接在全尺寸CT图像上训练模型，同时保持分割精度。我们从体系结构和实施中优化我们的网络。随着计算硬件的开发，例如图形处理单元（GPU）和张量处理单元（TPU），现在深度学习应用程序能够在可接受的时间内训练具有大型数据集的网络。在这些应用中，使用完全卷积网络（FCN）的语义细分也与计算机视觉和医疗图像处理领域的传统图像处理方法有了重大改进。但是，与计算机视觉任务中使用的一般颜色图像不同，医学图像的尺度大于3D计算机断层扫描（CT）图像，微型CT图像和组织病理学图像等颜色图像。对于培训这些医学图像，计算资源的巨大需求成为一个严重的问题。在本文中，我们提出了一个记忆效率的FCN，以解决临床CT图像中器官分割问题中高GPU记忆需求挑战。实验结果表明，我们的GPU记忆需求约为基线体系结构的40％，参数量约为基线的30％。

In this work, we present a memory-efficient fully convolutional network (FCN) incorporated with several memory-optimized techniques to reduce the run-time GPU memory demand during training phase. In medical image segmentation tasks, subvolume cropping has become a common preprocessing. Subvolumes (or small patch volumes) were cropped to reduce GPU memory demand. However, small patch volumes capture less spatial context that leads to lower accuracy. As a pilot study, the purpose of this work is to propose a memory-efficient FCN which enables us to train the model on full size CT image directly without subvolume cropping, while maintaining the segmentation accuracy. We optimize our network from both architecture and implementation. With the development of computing hardware, such as graphics processing unit (GPU) and tensor processing unit (TPU), now deep learning applications is able to train networks with large datasets within acceptable time. Among these applications, semantic segmentation using fully convolutional network (FCN) also has gained a significant improvement against traditional image processing approaches in both computer vision and medical image processing fields. However, unlike general color images used in computer vision tasks, medical images have larger scales than color images such as 3D computed tomography (CT) images, micro CT images, and histopathological images. For training these medical images, the large demand of computing resource become a severe problem. In this paper, we present a memory-efficient FCN to tackle the high GPU memory demand challenge in organ segmentation problem from clinical CT images. The experimental results demonstrated that our GPU memory demand is about 40% of baseline architecture, parameter amount is about 30% of the baseline.

下载PDF全文

下载文献需遵守相关版权规定

论文标题