论文标题
建模多跳问答作为单序列预测
Modeling Multi-hop Question Answering as Single Sequence Prediction
论文作者
论文摘要
Fusion-In-In-decoder(FID)(Izacard和Grave,2020)是一种生成的问题回答(QA)模型,它利用预先训练的变压器来实现通道检索,并在单跳QA上推动了最新的状态。但是,多跳质量检查的复杂性阻碍了生成质量质量检查方法的有效性。在这项工作中,我们提出了一种简单的生成方法(Pathfid),该方法通过明确建模推理过程来解决多跳问题的答案,从而将任务扩展到仅仅是回答生成之外。通过线性化支撑段落的层次结构推理路径,其关键句子以及最终的Factoid答案,我们将问题作为单个序列预测任务抛弃。为了促进多个线索的复杂推理,我们通过编码交叉通用相互作用进一步扩展了多个输入文档的统一平面表示。我们的广泛实验表明,途径填充了两个多跳QA数据集上的性能提高:HotPotQA和IIRC。除了表现增长外,帕菲德(Pathfid)也更容易解释,这反过来产生了与基线FID模型相比,更忠实地基于支持段落和事实的答案。
Fusion-in-decoder (Fid) (Izacard and Grave, 2020) is a generative question answering (QA) model that leverages passage retrieval with a pre-trained transformer and pushed the state of the art on single-hop QA. However, the complexity of multi-hop QA hinders the effectiveness of the generative QA approach. In this work, we propose a simple generative approach (PathFid) that extends the task beyond just answer generation by explicitly modeling the reasoning process to resolve the answer for multi-hop questions. By linearizing the hierarchical reasoning path of supporting passages, their key sentences, and finally the factoid answer, we cast the problem as a single sequence prediction task. To facilitate complex reasoning with multiple clues, we further extend the unified flat representation of multiple input documents by encoding cross-passage interactions. Our extensive experiments demonstrate that PathFid leads to strong performance gains on two multi-hop QA datasets: HotpotQA and IIRC. Besides the performance gains, PathFid is more interpretable, which in turn yields answers that are more faithfully grounded to the supporting passages and facts compared to the baseline Fid model.