论文标题
平衡随机森林的合作培训,以适应开放式域名
Collaborative Training of Balanced Random Forests for Open Set Domain Adaptation
论文作者
论文摘要
在本文中,我们介绍了平衡的随机森林的协作培训算法,该算法以及用于域名适应任务的卷积神经网络。在实际情况下,大多数域适应算法都面临着嘈杂,训练数据不足和开放式设置分类所面临的挑战。在这种情况下,常规方法遭受过度拟合和未能成功将源知识转移到目标域的情况。为了解决这些问题,提出了以下两种技术。首先,我们介绍了具有卷积神经网络的优化决策树构建方法,其中每个节点上的数据分为相等的大小,同时最大化信息增益。由于均匀的约束,它在深层特征上产生了平衡的决策树,这有助于增强的歧视能力和减少过度拟合的问题。其次,为了解决域未对准问题,我们提出了域的一致性损失,这会损失源和目标域数据的不平坦分裂。通过协作优化标记的源数据的信息增益以及未标记的目标数据分布的熵,所提出的COBRF算法的性能明显优于最先进的方法。
In this paper, we introduce a collaborative training algorithm of balanced random forests with convolutional neural networks for domain adaptation tasks. In real scenarios, most domain adaptation algorithms face the challenges from noisy, insufficient training data and open set categorization. In such cases, conventional methods suffer from overfitting and fail to successfully transfer the knowledge of the source to the target domain. To address these issues, the following two techniques are proposed. First, we introduce the optimized decision tree construction method with convolutional neural networks, in which the data at each node are split into equal sizes while maximizing the information gain. It generates balanced decision trees on deep features because of the even-split constraint, which contributes to enhanced discrimination power and reduced overfitting problem. Second, to tackle the domain misalignment problem, we propose the domain alignment loss which penalizes uneven splits of the source and target domain data. By collaboratively optimizing the information gain of the labeled source data as well as the entropy of unlabeled target data distributions, the proposed CoBRF algorithm achieves significantly better performance than the state-of-the-art methods.