安全和容忍的分散学习

论文标题

安全和容忍的分散学习

Secure and Fault Tolerant Decentralized Learning

论文作者

Prakash, Saurav, Hashemi, Hanieh, Wang, Yongqin, Annavaram, Murali, Avestimehr, Salman

论文摘要

联合学习（FL）是一个有希望的范式，用于训练全球模型，而不是分布在多个数据所有者的数据的情况下，而无需集中客户的原始数据。但是，本地模型更新的共享还可以揭示客户端本地数据集的信息。 META等公司最近部署了FL服务器内的可信执行环境（TEE）。但是，安全汇总可能会遭受易于出错的本地更新，而客户在培训期间由于设备故障而导致的培训中变得错误。此外，跨客户的数据异质性使减轻故障挑战性，因为即使是普通客户的更新也不相同。因此，将大多数先前的容错方法视为与其他大多数更新不同的本地更新的情况，其性能差。我们建议不同的fl，以使模型聚合安全以及对故障的鲁棒。在Diversefl中，任何本地模型更新与其相关指南更新有所不同的客户端都被标记为有故障。为了实现我们的新型每客户标准以缓解故障，Diversefl在FL服务器中创建了一个基于TEE的安全飞地，除了执行安全汇总以执行全球模型更新步骤外，还可以在培训前仅在培训前从每个客户端获得一次本地数据样本，并在培训过程中计算每个参与客户的更新。因此，Diversefl提供了防止隐私泄漏以及对错误客户的鲁棒性的安全性。在实验中，不同的FLLEFL始终在绝对测试准确性方面取得了显着提高，而不是先前的断层缓解基准。 Diversefl还与OraclesGD紧密相关，服务器仅结合了普通客户端的更新。我们还分析了在非IID数据和标准凸度假设下的多样性融合率。

Federated learning (FL) is a promising paradigm for training a global model over data distributed across multiple data owners without centralizing clients' raw data. However, sharing of local model updates can also reveal information of clients' local datasets. Trusted execution environments (TEEs) within the FL server have been recently deployed by companies like Meta for secure aggregation. However, secure aggregation can suffer from error-prone local updates sent by clients that become faulty during training due to underlying device malfunctions. Also, data heterogeneity across clients makes fault mitigation challenging, as even updates from normal clients are dissimilar. Thus, most of the prior fault tolerant methods, which treat any local update differing from the majority of other updates as faulty, perform poorly. We propose DiverseFL to make model aggregation secure as well as robust to faults. In DiverseFL, any client whose local model update diverges from its associated guiding update is tagged as being faulty. To implement our novel per-client criteria for fault mitigation, DiverseFL creates a TEE-based secure enclave within the FL server, which in addition to performing secure aggregation for carrying out the global model update step, securely receives a small representative sample of local data from each client only once before training, and computes guiding updates for each participating client during training. Thus, DiverseFL provides security against privacy leakage as well as robustness against faulty clients. In experiments, DiverseFL consistently achieves significant improvements in absolute test accuracy over prior fault mitigation benchmarks. DiverseFL also performs closely to OracleSGD, where server combines updates only from the normal clients. We also analyze the convergence rate of DiverseFL under non-IID data and standard convexity assumptions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题