论文标题

具有深度特征的视觉单词袋(bovw) - 乳腺肿瘤数据集有限数据集的补丁分类模型

Bag of Visual Words (BoVW) with Deep Features -- Patch Classification Model for Limited Dataset of Breast Tumours

论文作者

Tripathi, Suvidha, Singh, Satish Kumar, Kuan, Lee Hwee

论文摘要

当前,计算复杂性限制了使用卷积神经网络对高分辨率Gigapixel图像的训练。因此,将这些图像分为斑块或瓷砖。因此,这些高分辨率贴片是用歧视性信息编码的;在这些贴片上对CNN进行了训练,以执行补丁级的预测。但是,补丁级预测的问题是,病理学家通常在图像级上注释,而不是在斑块级别上。由于这种限制,大多数补丁可能不包含足够的相关功能。通过这项工作,我们试图通过使用视觉单词袋(BOVW)作为一种正规化来提高可推广性,将斑块描述能力纳入深层框架中。使用此假设,我们旨在构建一个基于斑块的分类器,以区分四类的乳房活检图像贴片(正常,良性,\ textit {intu {intu}癌,侵入性癌)。任务是使用CNN合并高质量的深度功能,以在图像中描述相关信息,同时使用视觉单词袋(BOVW)同时丢弃无关的信息。所提出的方法通过通过预训练的CNN获得从WSI和显微镜图像获得的斑块,以提取特征。 BOVW用作特征选择器,以选择CNN功能之间的大多数判别功能。最后,所选的功能集被归类为四个类之一。混合模型在选择预萃取的预训练模型方面提供了灵活性。该管道是端到端的,因为它不需要补丁预测的后处理来选择区分贴片。我们将观察结果与BACH-2018挑战数据集中的Resnet50,Densenet169和InceptionV3等最新方法进行了比较。我们提出的方法比所有三种方法显示出更好的性能。

Currently, the computational complexity limits the training of high resolution gigapixel images using Convolutional Neural Networks. Therefore, such images are divided into patches or tiles. Since, these high resolution patches are encoded with discriminative information therefore; CNNs are trained on these patches to perform patch-level predictions. However, the problem with patch-level prediction is that pathologist generally annotates at image-level and not at patch level. Due to this limitation most of the patches may not contain enough class-relevant features. Through this work, we tried to incorporate patch descriptive capability within the deep framework by using Bag of Visual Words (BoVW) as a kind of regularisation to improve generalizability. Using this hypothesis, we aim to build a patch based classifier to discriminate between four classes of breast biopsy image patches (normal, benign, \textit{In situ} carcinoma, invasive carcinoma). The task is to incorporate quality deep features using CNN to describe relevant information in the images while simultaneously discarding irrelevant information using Bag of Visual Words (BoVW). The proposed method passes patches obtained from WSI and microscopy images through pre-trained CNN to extract features. BoVW is used as a feature selector to select most discriminative features among the CNN features. Finally, the selected feature sets are classified as one of the four classes. The hybrid model provides flexibility in terms of choice of pre-trained models for feature extraction. The pipeline is end-to-end since it does not require post processing of patch predictions to select discriminative patches. We compared our observations with state-of-the-art methods like ResNet50, DenseNet169, and InceptionV3 on the BACH-2018 challenge dataset. Our proposed method shows better performance than all the three methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源