用嘈杂的标签进行沟通效率的强大联合学习

论文标题

用嘈杂的标签进行沟通效率的强大联合学习

Communication-Efficient Robust Federated Learning with Noisy Labels

论文作者

Li, Junyi, Pei, Jian, Huang, Heng

论文摘要

联合学习（FL）是在分布式的数据上进行的有希望的保护隐私的机器学习范式。在FL中，每个用户在本地保存数据。这样可以保护用户隐私，但也使服务器难以验证数据质量，尤其是在正确标记数据的情况下。用损坏的标签培训对联邦学习任务有害；但是，在标签噪声的情况下，很少关注FL。在本文中，我们专注于这个问题，并提出一种基于学习的重新加权方法，以减轻FL中嘈杂标签的效果。更确切地说，我们为每个培训样本调整了一个重量，以使学习模型在验证集上具有最佳的概括性能。更正式的是，该过程可以作为联合双重优化问题进行配合。二重性优化问题是一种优化问题，具有两个纠缠问题的级别。非分布的二聚体问题最近通过新的有效算法见证了显着的进展。但是，解决联合学习设置下的二重性优化问题的评价不足。我们确定高级评估中的高沟通成本是主要的瓶颈。因此，我们建议\ textIt {comm-fedbio}解决一般联合的二聚体优化问题；更具体地说，我们提出了两个沟通效率的子例程，以估计高度级别。还提供了所提出算法的收敛分析。最后，我们应用提出的算法来解决嘈杂的标签问题。与各种基线相比，我们的方法在几个现实世界数据集上表现出了出色的性能。

Federated learning (FL) is a promising privacy-preserving machine learning paradigm over distributed located data. In FL, the data is kept locally by each user. This protects the user privacy, but also makes the server difficult to verify data quality, especially if the data are correctly labeled. Training with corrupted labels is harmful to the federated learning task; however, little attention has been paid to FL in the case of label noise. In this paper, we focus on this problem and propose a learning-based reweighting approach to mitigate the effect of noisy labels in FL. More precisely, we tuned a weight for each training sample such that the learned model has optimal generalization performance over a validation set. More formally, the process can be formulated as a Federated Bilevel Optimization problem. Bilevel optimization problem is a type of optimization problem with two levels of entangled problems. The non-distributed bilevel problems have witnessed notable progress recently with new efficient algorithms. However, solving bilevel optimization problems under the Federated Learning setting is under-investigated. We identify that the high communication cost in hypergradient evaluation is the major bottleneck. So we propose \textit{Comm-FedBiO} to solve the general Federated Bilevel Optimization problems; more specifically, we propose two communication-efficient subroutines to estimate the hypergradient. Convergence analysis of the proposed algorithms is also provided. Finally, we apply the proposed algorithms to solve the noisy label problem. Our approach has shown superior performance on several real-world datasets compared to various baselines.

下载PDF全文

下载文献需遵守相关版权规定

论文标题