用于文本情感域适应的课程自行车gan，多个来源

论文标题

用于文本情感域适应的课程自行车gan，多个来源

Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources

论文作者

Zhao, Sicheng, Xiao, Yang, Guo, Jiang, Yue, Xiangyu, Yang, Jufeng, Krishna, Ravi, Xu, Pengfei, Keutzer, Kurt

论文摘要

对社交网络中用户生成的评论或评论的情感分析可以帮助企业分析客户的反馈并采取相应的措施以进行改进。为了减轻目标域上的大规模注释，域适应性（DA）通过从其他标记的源域中学习可转移的模型来提供替代解决方案。现有的多源域适应（MDA）方法要么无法在目标域中提取与情感相关的某些歧视性特征，要么忽略了不同来源的相关性以及即使在同一来源中的不同子域之间的相关性以及在不同训练阶段的不同子域之间的分布差异。在本文中，我们提出了一个新颖的实例级MDA框架，称为课程循环一致的生成对抗网络（C-Cyclegan），以解决上述问题。 Specifically, C-CycleGAN consists of three components: (1) pre-trained text encoder which encodes textual input from different domains into a continuous representation space, (2) intermediate domain generator with curriculum instance-level adaptation which bridges the gap across source and target domains, and (3) task classifier trained on the intermediate domain for final sentiment classification. C-Cyclegan以实例级的源样本传输到一个中间域，该域更接近目标域，并保留了情感语义，而不会丢失判别特征。此外，我们的动态实例级加权机制可以在每个训练阶段将最佳权重分配给不同的源样本。我们在三个基准数据集上进行了广泛的实验，并对最先进的DA方法取得了可观的收益。我们的源代码在以下网址发布：https：//github.com/warushrush/curriculum-cyclegan。

Sentiment analysis of user-generated reviews or comments on products and services in social networks can help enterprises to analyze the feedback from customers and take corresponding actions for improvement. To mitigate large-scale annotations on the target domain, domain adaptation (DA) provides an alternate solution by learning a transferable model from other labeled source domains. Existing multi-source domain adaptation (MDA) methods either fail to extract some discriminative features in the target domain that are related to sentiment, neglect the correlations of different sources and the distribution difference among different sub-domains even in the same source, or cannot reflect the varying optimal weighting during different training stages. In this paper, we propose a novel instance-level MDA framework, named curriculum cycle-consistent generative adversarial network (C-CycleGAN), to address the above issues. Specifically, C-CycleGAN consists of three components: (1) pre-trained text encoder which encodes textual input from different domains into a continuous representation space, (2) intermediate domain generator with curriculum instance-level adaptation which bridges the gap across source and target domains, and (3) task classifier trained on the intermediate domain for final sentiment classification. C-CycleGAN transfers source samples at instance-level to an intermediate domain that is closer to the target domain with sentiment semantics preserved and without losing discriminative features. Further, our dynamic instance-level weighting mechanisms can assign the optimal weights to different source samples in each training stage. We conduct extensive experiments on three benchmark datasets and achieve substantial gains over state-of-the-art DA approaches. Our source code is released at: https://github.com/WArushrush/Curriculum-CycleGAN.

下载PDF全文

下载文献需遵守相关版权规定

论文标题