论文标题
有效的半监督组织学图像分类的教师链链
Teacher-Student chain for efficient semi-supervised histology image classification
论文作者
论文摘要
深度学习显示出数字病理领域的巨大潜力。自动化的数字病理系统可以作为第二读者,在大型筛查研究中进行初始分类或协助报告。但是,详尽注释大型组织学图像数据库是昂贵的,因为医学专家是一种稀缺的资源。在本文中,我们应用了Yalniz等人提出的半监督教师知识蒸馏技术。 (2019)量化结直肠癌的预后特征的任务。我们通过将这种方法扩展到一系列学生来获得准确的提高,在该链中,每个学生的预测都用于培训下一个学生,即学生成为老师。使用链方法,仅标记为0.5%的数据(未标记池中的剩余99.5%),我们匹配100%标记数据的培训的准确性。在标记数据的百分比较低的情况下,可以看到类似的准确性提高,即使最初的标记训练集选择不良,也可以恢复准确性。总之,这种方法显示出减轻注释负担的希望,从而增加了自动化数字病理系统的负担能力。
Deep learning shows great potential for the domain of digital pathology. An automated digital pathology system could serve as a second reader, perform initial triage in large screening studies, or assist in reporting. However, it is expensive to exhaustively annotate large histology image databases, since medical specialists are a scarce resource. In this paper, we apply the semi-supervised teacher-student knowledge distillation technique proposed by Yalniz et al. (2019) to the task of quantifying prognostic features in colorectal cancer. We obtain accuracy improvements through extending this approach to a chain of students, where each student's predictions are used to train the next student i.e. the student becomes the teacher. Using the chain approach, and only 0.5% labelled data (the remaining 99.5% in the unlabelled pool), we match the accuracy of training on 100% labelled data. At lower percentages of labelled data, similar gains in accuracy are seen, allowing some recovery of accuracy even from a poor initial choice of labelled training set. In conclusion, this approach shows promise for reducing the annotation burden, thus increasing the affordability of automated digital pathology systems.