SRQA：FACTOID问题回答的合成阅读器

论文标题

SRQA：FACTOID问题回答的合成阅读器

SRQA: Synthetic Reader for Factoid Question Answering

论文作者

Wang, Jiuniu, Xu, Wenjia, Fu, Xingyu, Wei, Yang, Jin, Li, Chen, Ziyan, Xu, Guangluan, Wu, Yirong

论文摘要

问题回答系统可以回答具有深层神经网络的各个领域和形式的问题，但是在面对多种证据时，它仍然缺乏有效的方式。我们介绍了一个名为SRQA的新模型，这意味着合成的读取器用于FACTOID问题回答。该模型从三个方面的多文章方案中增强了答案系统：模型结构，优化目标和训练方法，对应于多层关注（MA），交叉证据（CE）和对抗性培训（AT）。首先，我们提出了一个多层注意网络，以更好地表示证据。多层注意机制在每个层中的问题与通过之间进行相互作用，使每个层中证据的令牌表示都考虑到了问题的要求。其次，我们设计了一个交叉证据策略，以在更多证据中选择答案跨度。我们改善了优化目标，将所有答案的位置视为培训目标，这使该模型在多个证据之间进行推理。第三，除了我们的模型中的嵌入一词外，对对抗训练还用于高级变量。还提出了一种新的归一化方法，用于对抗扰动，以便我们可以将扰动共同添加到几个目标变量。作为一种有效的正则化方法，对抗训练增强了模型处理嘈杂数据的能力。结合了这三种策略，我们增强了我们的模型的上下文表示和定位能力，这可以合成从几种证据中提取答案跨度。我们在WebQA数据集上执行SRQA，实验表明我们的模型的表现优于最先进的模型（我们模型的最佳模糊分数高达78.56％，提高了约2％）。

The question answering system can answer questions from various fields and forms with deep neural networks, but it still lacks effective ways when facing multiple evidences. We introduce a new model called SRQA, which means Synthetic Reader for Factoid Question Answering. This model enhances the question answering system in the multi-document scenario from three aspects: model structure, optimization goal, and training method, corresponding to Multilayer Attention (MA), Cross Evidence (CE), and Adversarial Training (AT) respectively. First, we propose a multilayer attention network to obtain a better representation of the evidences. The multilayer attention mechanism conducts interaction between the question and the passage within each layer, making the token representation of evidences in each layer takes the requirement of the question into account. Second, we design a cross evidence strategy to choose the answer span within more evidences. We improve the optimization goal, considering all the answers' locations in multiple evidences as training targets, which leads the model to reason among multiple evidences. Third, adversarial training is employed to high-level variables besides the word embedding in our model. A new normalization method is also proposed for adversarial perturbations so that we can jointly add perturbations to several target variables. As an effective regularization method, adversarial training enhances the model's ability to process noisy data. Combining these three strategies, we enhance the contextual representation and locating ability of our model, which could synthetically extract the answer span from several evidences. We perform SRQA on the WebQA dataset, and experiments show that our model outperforms the state-of-the-art models (the best fuzzy score of our model is up to 78.56%, with an improvement of about 2%).

下载PDF全文

下载文献需遵守相关版权规定

论文标题