论文标题
深度学习胃肠道内窥镜检查和疾病实例的检测和分割
Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy
论文作者
论文摘要
内窥镜计算机视觉挑战(ENDOCV)是一项众包计划,旨在解决开发可靠的计算机辅助检测和诊断内窥镜系统的显着问题,并提出了用于技术临床翻译的途径。虽然内窥镜检查是一种广泛使用的诊断和治疗工具,用于空心孔,但内窥镜医生经常面临一些核心挑战,主要是:1)存在多级伪像的存在,这些人工制品的存在会阻碍其视觉解释,而2)难以识别微妙的前癌前体和癌症异常。人工制品通常会影响应用于胃肠道器官的深度学习方法的鲁棒性,因为它们可以与感兴趣的组织混淆。 EndoCV2020挑战旨在解决这些汇款中的研究问题。在本文中,我们介绍了由前17个团队开发的方法的摘要,并提供了由参与者设计的最新方法和方法的客观比较:i)i)伪影检测和细分(EAD2020),以及II)疾病检测和分割(EDD2020)。 EAD2020和EDD2020子挑战者均编译了多中心,多器官,多级和多模式临床内窥镜数据集。还评估了检测算法的样本外概括能力。尽管大多数团队都专注于准确的改进,但只有少数方法可信赖临床可用性。表现最好的团队通过探索数据扩展,数据融合和最佳类阈值技术来解决级别,原点,模式和事件的阶级失衡以及变化的解决方案。
The Endoscopy Computer Vision Challenge (EndoCV) is a crowd-sourcing initiative to address eminent problems in developing reliable computer aided detection and diagnosis endoscopy systems and suggest a pathway for clinical translation of technologies. Whilst endoscopy is a widely used diagnostic and treatment tool for hollow-organs, there are several core challenges often faced by endoscopists, mainly: 1) presence of multi-class artefacts that hinder their visual interpretation, and 2) difficulty in identifying subtle precancerous precursors and cancer abnormalities. Artefacts often affect the robustness of deep learning methods applied to the gastrointestinal tract organs as they can be confused with tissue of interest. EndoCV2020 challenges are designed to address research questions in these remits. In this paper, we present a summary of methods developed by the top 17 teams and provide an objective comparison of state-of-the-art methods and methods designed by the participants for two sub-challenges: i) artefact detection and segmentation (EAD2020), and ii) disease detection and segmentation (EDD2020). Multi-center, multi-organ, multi-class, and multi-modal clinical endoscopy datasets were compiled for both EAD2020 and EDD2020 sub-challenges. The out-of-sample generalization ability of detection algorithms was also evaluated. Whilst most teams focused on accuracy improvements, only a few methods hold credibility for clinical usability. The best performing teams provided solutions to tackle class imbalance, and variabilities in size, origin, modality and occurrences by exploring data augmentation, data fusion, and optimal class thresholding techniques.