论文标题
基于CT的COVID-19分类:深度多任务学习改善了关节识别和严重性量化
CT-based COVID-19 Triage: Deep Multitask Learning Improves Joint Identification and Severity Quantification
论文作者
论文摘要
当前的COVID-19大流行超载超载医疗保健系统,包括放射学部门。尽管开发了几种深度学习方法来协助CT分析,但没有人将研究分类直接视为计算机科学问题。我们描述了两个基本设置:识别Covid-19,以优先考虑对潜在感染患者的研究,以尽早分离它们;严重性量化以突出严重患者的研究,并将其引导到医院或提供紧急医疗服务。我们将这些任务形式化为二进制分类和受影响肺百分比的估计。尽管分别研究了类似的问题,但我们表明现有方法仅为这些设置之一提供合理的质量。我们采用多任务方法来巩固这两种分类方法,并提出卷积神经网络,以将所有可用标签组合在单个模型中。与最流行的多任务方法相反,我们将分类层添加到U-NET最空间详细的上部,而不是底部,较不详细的潜在表示。我们对大约2000个公开可用的CT研究培训了模型,并通过经过精心设计的集合进行对其进行测试,该集合由32例Covid-19研究,30例细菌性肺炎,31例健康患者和30例患有其他肺部病理学的患者模仿典型的患者流动,以模仿门诊医院的典型患者。提出的多任务模型优于基于潜在的模型,并实现ROC AUC评分,范围从0.87+-01(细菌肺炎)到0.97+-01(健康对照组),用于鉴定COVID-19和0.97+-01 Spearman相关性,以实现严重量化的率。我们发布所有代码并创建一个公共排序板,其他社区成员可以在我们的测试数据集上测试其模型。
The current COVID-19 pandemic overloads healthcare systems, including radiology departments. Though several deep learning approaches were developed to assist in CT analysis, nobody considered study triage directly as a computer science problem. We describe two basic setups: Identification of COVID-19 to prioritize studies of potentially infected patients to isolate them as early as possible; Severity quantification to highlight studies of severe patients and direct them to a hospital or provide emergency medical care. We formalize these tasks as binary classification and estimation of affected lung percentage. Though similar problems were well-studied separately, we show that existing methods provide reasonable quality only for one of these setups. We employ a multitask approach to consolidate both triage approaches and propose a convolutional neural network to combine all available labels within a single model. In contrast with the most popular multitask approaches, we add classification layers to the most spatially detailed upper part of U-Net instead of the bottom, less detailed latent representation. We train our model on approximately 2000 publicly available CT studies and test it with a carefully designed set consisting of 32 COVID-19 studies, 30 cases with bacterial pneumonia, 31 healthy patients, and 30 patients with other lung pathologies to emulate a typical patient flow in an out-patient hospital. The proposed multitask model outperforms the latent-based one and achieves ROC AUC scores ranging from 0.87+-01 (bacterial pneumonia) to 0.97+-01 (healthy controls) for Identification of COVID-19 and 0.97+-01 Spearman Correlation for Severity quantification. We release all the code and create a public leaderboard, where other community members can test their models on our test dataset.