论文标题

基于动态融合的联合学习,用于COVID-19检测

Dynamic Fusion based Federated Learning for COVID-19 Detection

论文作者

Zhang, Weishan, Zhou, Tao, Lu, Qinghua, Wang, Xiao, Zhu, Chunsheng, Sun, Haoyun, Wang, Zhipeng, Lo, Sin Kit, Wang, Fei-Yue

论文摘要

使用机器学习的医学诊断图像分析(例如,CT扫描或X射线)是检测COVID-19感染的有效而准确的方法。但是,由于患者隐私的关注,通常不允许在医疗机构之间共享诊断图像。这导致数据集的问题不足以训练图像分类模型。 Federated Learning是一种新兴的隐私机器学习范式,它基于由客户培训的本地模型的收到的更新而无需交换客户的本地数据而产生无偏见的全球模型。然而,联合学习的默认设置引入了传输模型更新的巨大沟通成本,并且在客户的数据异质性大量存在时几乎无法确保模型性能。为了提高沟通效率和模型性能,在本文中,我们提出了一种基于动态融合的新型联合学习方法,用于医学诊断图像分析,以检测COVID-19的感染。首先,我们为基于动态融合的联合学习系统设计了一个体系结构,以分析医学诊断图像。此外,我们提出了一种动态的融合方法,可以根据本地模型性能动态决定参与客户,并安排基于参与客户的培训时间的模型融合。此外,我们总结了用于COVID-19检测的医学诊断图像数据集类别,可以由机器学习社区使用该数据集进行图像分析。评估结果表明,所提出的方法是可行的,并且在模型性能,沟通效率和容错性方面的默认设置比联合学习的默认设置更好。

Medical diagnostic image analysis (e.g., CT scan or X-Ray) using machine learning is an efficient and accurate way to detect COVID-19 infections. However, sharing diagnostic images across medical institutions is usually not allowed due to the concern of patients' privacy. This causes the issue of insufficient datasets for training the image classification model. Federated learning is an emerging privacy-preserving machine learning paradigm that produces an unbiased global model based on the received updates of local models trained by clients without exchanging clients' local data. Nevertheless, the default setting of federated learning introduces huge communication cost of transferring model updates and can hardly ensure model performance when data heterogeneity of clients heavily exists. To improve communication efficiency and model performance, in this paper, we propose a novel dynamic fusion-based federated learning approach for medical diagnostic image analysis to detect COVID-19 infections. First, we design an architecture for dynamic fusion-based federated learning systems to analyse medical diagnostic images. Further, we present a dynamic fusion method to dynamically decide the participating clients according to their local model performance and schedule the model fusion-based on participating clients' training time. In addition, we summarise a category of medical diagnostic image datasets for COVID-19 detection, which can be used by the machine learning community for image analysis. The evaluation results show that the proposed approach is feasible and performs better than the default setting of federated learning in terms of model performance, communication efficiency and fault tolerance.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源