论文标题
保存隐私的联合特级树
Federated Extra-Trees with Privacy Preserving
论文作者
论文摘要
通常观察到数据散布在各处,难以集中。数据隐私和安全性也成为一个敏感的话题。诸如欧盟一般数据保护法规(GDPR)之类的法律和法规旨在保护公众的数据隐私。但是,机器学习需要大量数据才能提高性能,并且当前情况使部署现实生活中的AI应用程序处于极为困难的情况。为了应对这些挑战,在本文中,我们提出了一种新颖的保护隐私的联合机器学习模型,名为Federated Fried-Trees,该模型在联合树模型中应用了当地的差异隐私。开发了安全的多机构学习系统,以通过在不交换任何原始数据的情况下共同处理建模,从而提供出色的性能。我们通过在公共数据集上进行广泛的实验来验证工作的准确性,并通过模拟现实世界情景来验证效率和鲁棒性。总体而言,我们提出了一种可扩展,可扩展和实用的解决方案来解决数据岛问题。
It is commonly observed that the data are scattered everywhere and difficult to be centralized. The data privacy and security also become a sensitive topic. The laws and regulations such as the European Union's General Data Protection Regulation (GDPR) are designed to protect the public's data privacy. However, machine learning requires a large amount of data for better performance, and the current circumstances put deploying real-life AI applications in an extremely difficult situation. To tackle these challenges, in this paper we propose a novel privacy-preserving federated machine learning model, named Federated Extra-Trees, which applies local differential privacy in the federated trees model. A secure multi-institutional machine learning system was developed to provide superior performance by processing the modeling jointly on different clients without exchanging any raw data. We have validated the accuracy of our work by conducting extensive experiments on public datasets and the efficiency and robustness were also verified by simulating the real-world scenarios. Overall, we presented an extensible, scalable and practical solution to handle the data island problem.