无监督和可解释的域名适应紧急服务的快速过滤推文

论文标题

无监督和可解释的域名适应紧急服务的快速过滤推文

Unsupervised and Interpretable Domain Adaptation to Rapidly Filter Tweets for Emergency Services

论文作者

Krishnan, Jitin, Purohit, Hemant, Rangwala, Huzefa

论文摘要

在灾难事件发生期间，由于其在正在进行的危机的数据集中，从社交网络数据中过滤相关信息的实用性很大。在本文中，我们假设通过多任务学习的无监督域适应可能是一个有用的框架，可以利用过去危机事件中的数据来在新危机突然发作期间训练有效的信息过滤模型。我们提出了一种新的方法，可以使用TREC事件流的公开数据集对正在进行的危机期间进行相关推文进行分类。具体而言，我们构建了一个定制的多任务架构，该体系结构具有危机分析的多域歧视器：多任务域对抗性注意网络。该模型由每个任务的专用注意层组成，以提供模型的解释性。对现实词的应用至关重要。随着深层网络在稀疏数据集的困扰中，我们表明可以通过共享用于多任务学习和域对抗培训的基础层来改进这一点。通过选择目标事件作为测试集和其余的培训来评估危机事件的域适应性。我们的结果表明，多任务模型的表现优于其单个任务。为了对可解释性的定性评估，我们表明，通过在分类过程中展示一条认为很重要的推文中的单词，可以将注意力层用作解释模型预测并赋予紧急服务以探索模型问责的指南。最后，我们通过为COVID-19大流行提供了用例来展示我们的工作的实际含义。

During the onset of a disaster event, filtering relevant information from the social web data is challenging due to its sparse availability and practical limitations in labeling datasets of an ongoing crisis. In this paper, we hypothesize that unsupervised domain adaptation through multi-task learning can be a useful framework to leverage data from past crisis events for training efficient information filtering models during the sudden onset of a new crisis. We present a novel method to classify relevant tweets during an ongoing crisis without seeing any new examples, using the publicly available dataset of TREC incident streams. Specifically, we construct a customized multi-task architecture with a multi-domain discriminator for crisis analytics: multi-task domain adversarial attention network. This model consists of dedicated attention layers for each task to provide model interpretability; critical for real-word applications. As deep networks struggle with sparse datasets, we show that this can be improved by sharing a base layer for multi-task learning and domain adversarial training. Evaluation of domain adaptation for crisis events is performed by choosing a target event as the test set and training on the rest. Our results show that the multi-task model outperformed its single task counterpart. For the qualitative evaluation of interpretability, we show that the attention layer can be used as a guide to explain the model predictions and empower emergency services for exploring accountability of the model, by showcasing the words in a tweet that are deemed important in the classification process. Finally, we show a practical implication of our work by providing a use-case for the COVID-19 pandemic.

下载PDF全文

下载文献需遵守相关版权规定

论文标题