上下文在哪里？ - 对最近对话数据集的批评

论文标题

上下文在哪里？ - 对最近对话数据集的批评

Where is the context? -- A critique of recent dialogue datasets

论文作者

Mosig, Johannes E. M., Vlasov, Vladimir, Nichol, Alan

论文摘要

Multiwoz 2.1和TaskMaster-1（Taskmaster-1）等最新的对话数据集构成了当今对话模型的一些最具挑战性的任务，因此广泛用于系统评估。我们确定了上述数据集的几个问题，例如历史独立性，强大的知识基础依赖性和模棱两可的系统响应。最后，我们概述了未来数据集的关键Desiderata，我们认为这将更适合于对话人工智能的构建。

Recent dialogue datasets like MultiWOZ 2.1 and Taskmaster-1 constitute some of the most challenging tasks for present-day dialogue models and, therefore, are widely used for system evaluation. We identify several issues with the above-mentioned datasets, such as history independence, strong knowledge base dependence, and ambiguous system responses. Finally, we outline key desiderata for future datasets that we believe would be more suitable for the construction of conversational artificial intelligence.

下载PDF全文

下载文献需遵守相关版权规定

论文标题