论文标题

有效的半监督组织学图像分类的教师链链

Teacher-Student chain for efficient semi-supervised histology image classification

论文作者

Shaw, Shayne, Pajak, Maciej, Lisowska, Aneta, Tsaftaris, Sotirios A, O'Neil, Alison Q

论文摘要

深度学习显示出数字病理领域的巨大潜力。自动化的数字病理系统可以作为第二读者,在大型筛查研究中进行初始分类或协助报告。但是,详尽注释大型组织学图像数据库是昂贵的,因为医学专家是一种稀缺的资源。在本文中,我们应用了Yalniz等人提出的半监督教师知识蒸馏技术。 (2019)量化结直肠癌的预后特征的任务。我们通过将这种方法扩展到一系列学生来获得准确的提高,在该链中,每个学生的预测都用于培训下一个学生,即学生成为老师。使用链方法,仅标记为0.5%的数据(未标记池中的剩余99.5%),我们匹配100%标记数据的培训的准确性。在标记数据的百分比较低的情况下,可以看到类似的准确性提高,即使最初的标记训练集选择不良,也可以恢复准确性。总之,这种方法显示出减轻注释负担的希望,从而增加了自动化数字病理系统的负担能力。

Deep learning shows great potential for the domain of digital pathology. An automated digital pathology system could serve as a second reader, perform initial triage in large screening studies, or assist in reporting. However, it is expensive to exhaustively annotate large histology image databases, since medical specialists are a scarce resource. In this paper, we apply the semi-supervised teacher-student knowledge distillation technique proposed by Yalniz et al. (2019) to the task of quantifying prognostic features in colorectal cancer. We obtain accuracy improvements through extending this approach to a chain of students, where each student's predictions are used to train the next student i.e. the student becomes the teacher. Using the chain approach, and only 0.5% labelled data (the remaining 99.5% in the unlabelled pool), we match the accuracy of training on 100% labelled data. At lower percentages of labelled data, similar gains in accuracy are seen, allowing some recovery of accuracy even from a poor initial choice of labelled training set. In conclusion, this approach shows promise for reducing the annotation burden, thus increasing the affordability of automated digital pathology systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源