通过增强学习选择合成样品

论文标题

通过增强学习选择合成样品

Synthetic Sample Selection via Reinforcement Learning

论文作者

Ye, Jiarong, Xue, Yuan, Long, L. Rodney, Antani, Sameer, Xue, Zhiyun, Cheng, Keith, Huang, Xiaolei

论文摘要

综合现实的医学图像为基于深度学习的医学图像识别系统中培训数据短缺提供了可行的解决方案。但是，用于数据增强目的的合成图像的质量控制不足，并且某些生成的图像不现实，并且可能包含与真实图像混合时扭曲数据分布的误导性特征。因此，这些合成图像在医学图像识别系统中的有效性在没有质量保证的情况下随机添加时无法保证它们。在这项工作中，我们提出了基于增强学习（RL）的合成样品选择方法，该方法学会选择包含可靠和信息特征的合成图像。基于变压器的控制器通过验证分类精度作为奖励通过近端策略优化（PPO）进行培训。选定的图像与原始培训数据混合，以改善对图像识别系统的训练。为了验证我们的方法，我们以病理图像识别为例，并在两个组织病理学图像数据集上进行广泛的实验。在宫颈数据集和淋巴结数据集的实验中，当利用由我们的RL框架选择的高质量合成图像时，图像分类性能分别提高了8.1％和2.3％。我们提出的合成样品选择方法是一般的，并且具有有限的注释，可以提高各种医学图像识别系统的性能。

Synthesizing realistic medical images provides a feasible solution to the shortage of training data in deep learning based medical image recognition systems. However, the quality control of synthetic images for data augmentation purposes is under-investigated, and some of the generated images are not realistic and may contain misleading features that distort data distribution when mixed with real images. Thus, the effectiveness of those synthetic images in medical image recognition systems cannot be guaranteed when they are being added randomly without quality assurance. In this work, we propose a reinforcement learning (RL) based synthetic sample selection method that learns to choose synthetic images containing reliable and informative features. A transformer based controller is trained via proximal policy optimization (PPO) using the validation classification accuracy as the reward. The selected images are mixed with the original training data for improved training of image recognition systems. To validate our method, we take the pathology image recognition as an example and conduct extensive experiments on two histopathology image datasets. In experiments on a cervical dataset and a lymph node dataset, the image classification performance is improved by 8.1% and 2.3%, respectively, when utilizing high-quality synthetic images selected by our RL framework. Our proposed synthetic sample selection method is general and has great potential to boost the performance of various medical image recognition systems given limited annotation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题