论文标题
FEDOCR:场景文本识别的沟通效率的联合学习
FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition
论文作者
论文摘要
尽管现场文本识别技术已被广泛用于商业应用中,但该研究社区很少考虑数据隐私。大多数现有算法都采用了一组共享或集中式培训数据。但是,在实践中,数据可以分布在不同的本地设备上,这些设备由于隐私限制而无法共享。在本文中,我们研究了如何利用分散的数据集来训练强大的场景文本识别器,同时使其保持在本地设备上。据我们所知,我们提出了第一个利用联合学习的框架来进行场景文本识别,该识别是通过分散的数据集进行协作培训的。因此,我们将其命名为Fedocr。为了使FedCor相当适合在最终设备上部署,我们进行了两个改进,包括使用轻巧的型号和哈希技术。我们认为,就联邦学习的沟通效率而言,这两者对Fedocr至关重要。分散数据集的模拟表明,拟议中的Fedocr为经过集中数据培训的模型取得了竞争成果,其通信成本和更高级别的隐私性具有较少的成本。
While scene text recognition techniques have been widely used in commercial applications, data privacy has rarely been taken into account by this research community. Most existing algorithms have assumed a set of shared or centralized training data. However, in practice, data may be distributed on different local devices that can not be centralized to share due to the privacy restrictions. In this paper, we study how to make use of decentralized datasets for training a robust scene text recognizer while keeping them stay on local devices. To the best of our knowledge, we propose the first framework leveraging federated learning for scene text recognition, which is trained with decentralized datasets collaboratively. Hence we name it FedOCR. To make FedCOR fairly suitable to be deployed on end devices, we make two improvements including using lightweight models and hashing techniques. We argue that both are crucial for FedOCR in terms of the communication efficiency of federated learning. The simulations on decentralized datasets show that the proposed FedOCR achieves competitive results to the models that are trained with centralized data, with fewer communication costs and higher-level privacy-preserving.