论文标题
通过知识转移的联合半监督域的适应
Federated Semi-Supervised Domain Adaptation via Knowledge Transfer
论文作者
论文摘要
鉴于机器学习环境快速变化和昂贵的数据标记,当来自源域的标记数据与来自目标域的部分标记的数据在统计上不同时,必须进行半监督域的适应(SSDA)。大多数先前的SSDA研究都在集中进行,需要访问源和目标数据。但是,如今许多字段中的数据是由分布式终端设备生成的。由于隐私问题,数据可能是本地存储的,无法共享,从而导致现有SSDA研究的无效性。本文提出了一种创新的方法,可以通过由联邦半监督域适应(FSSDA)命名的多个分布式和机密数据集实现SSDA。 FSSDA基于战略设计的知识蒸馏技术将SSDA与联合学习集成在一起,该技术通过并行执行源和目标培训来提高效率。此外,FSSDA通过正确选择关键参数(即模仿参数)来控制跨域传输的知识量。此外,提出的FSSDA可以有效地推广到多源域适应方案。进行了广泛的实验,以证明FSSDA设计的有效性和效率。
Given the rapidly changing machine learning environments and expensive data labeling, semi-supervised domain adaptation (SSDA) is imperative when the labeled data from the source domain is statistically different from the partially labeled data from the target domain. Most prior SSDA research is centrally performed, requiring access to both source and target data. However, data in many fields nowadays is generated by distributed end devices. Due to privacy concerns, the data might be locally stored and cannot be shared, resulting in the ineffectiveness of existing SSDA research. This paper proposes an innovative approach to achieve SSDA over multiple distributed and confidential datasets, named by Federated Semi-Supervised Domain Adaptation (FSSDA). FSSDA integrates SSDA with federated learning based on strategically designed knowledge distillation techniques, whose efficiency is improved by performing source and target training in parallel. Moreover, FSSDA controls the amount of knowledge transferred across domains by properly selecting a key parameter, i.e., the imitation parameter. Further, the proposed FSSDA can be effectively generalized to multi-source domain adaptation scenarios. Extensive experiments are conducted to demonstrate the effectiveness and efficiency of FSSDA design.