论文标题

基于CT的COVID-19分类:深度多任务学习改善了关节识别和严重性量化

CT-based COVID-19 Triage: Deep Multitask Learning Improves Joint Identification and Severity Quantification

论文作者

Goncharov, Mikhail, Pisov, Maxim, Shevtsov, Alexey, Shirokikh, Boris, Kurmukov, Anvar, Blokhin, Ivan, Chernina, Valeria, Solovev, Alexander, Gombolevskiy, Victor, Morozov, Sergey, Belyaev, Mikhail

论文摘要

当前的COVID-19大流行超载超载医疗保健系统,包括放射学部门。尽管开发了几种深度学习方法来协助CT分析,但没有人将研究分类直接视为计算机科学问题。我们描述了两个基本设置:识别Covid-19,以优先考虑对潜在感染患者的研究,以尽早分离它们;严重性量化以突出严重患者的研究,并将其引导到医院或提供紧急医疗服务。我们将这些任务形式化为二进制分类和受影响肺百分比的估计。尽管分别研究了类似的问题,但我们表明现有方法仅为这些设置之一提供合理的质量。我们采用多任务方法来巩固这两种分类方法,并提出卷积神经网络,以将所有可用标签组合在单个模型中。与最流行的多任务方法相反,我们将分类层添加到U-NET最空间详细的上部,而不是底部,较不详细的潜在表示。我们对大约2000个公开可用的CT研究培训了模型,并通过经过精心设计的集合进行对其进行测试,该集合由32例Covid-19研究,30例细菌性肺炎,31例健康患者和30例患有其他肺部病理学的患者模仿典型的患者流动,以模仿门诊医院的典型患者。提出的多任务模型优于基于潜在的模型,并实现ROC AUC评分,范围从0.87+-01(细菌肺炎)到0.97+-01(健康对照组),用于鉴定COVID-19和0.97+-01 Spearman相关性,以实现严重量化的率。我们发布所有代码并创建一个公共排序板,其他社区成员可以在我们的测试数据集上测试其模型。

The current COVID-19 pandemic overloads healthcare systems, including radiology departments. Though several deep learning approaches were developed to assist in CT analysis, nobody considered study triage directly as a computer science problem. We describe two basic setups: Identification of COVID-19 to prioritize studies of potentially infected patients to isolate them as early as possible; Severity quantification to highlight studies of severe patients and direct them to a hospital or provide emergency medical care. We formalize these tasks as binary classification and estimation of affected lung percentage. Though similar problems were well-studied separately, we show that existing methods provide reasonable quality only for one of these setups. We employ a multitask approach to consolidate both triage approaches and propose a convolutional neural network to combine all available labels within a single model. In contrast with the most popular multitask approaches, we add classification layers to the most spatially detailed upper part of U-Net instead of the bottom, less detailed latent representation. We train our model on approximately 2000 publicly available CT studies and test it with a carefully designed set consisting of 32 COVID-19 studies, 30 cases with bacterial pneumonia, 31 healthy patients, and 30 patients with other lung pathologies to emulate a typical patient flow in an out-patient hospital. The proposed multitask model outperforms the latent-based one and achieves ROC AUC scores ranging from 0.87+-01 (bacterial pneumonia) to 0.97+-01 (healthy controls) for Identification of COVID-19 and 0.97+-01 Spearman Correlation for Severity quantification. We release all the code and create a public leaderboard, where other community members can test their models on our test dataset.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源