论文标题
Pquad:一个波斯问题回答数据集
PQuAD: A Persian Question Answering Dataset
论文作者
论文摘要
我们提出了波斯问题回答数据集(Pquad),这是波斯Wikipedia文章的众包阅读理解数据集。它包括80,000个问题及其答案,其中25%的问题在对抗上无法回答。我们检查了数据集的各种属性,以显示其作为MRC基准的多样性及其难度的水平。通过释放此数据集,我们旨在减轻有关波斯阅读理解和波斯问答系统的发展的研究。我们对不同最先进的预训练的上下文化语言模型进行的实验显示了74.8%的精确匹配(EM)和87.6%的F1分数,可以用作有关波斯质量质量质量质量检查的进一步研究的基线结果。
We present Persian Question Answering Dataset (PQuAD), a crowdsourced reading comprehension dataset on Persian Wikipedia articles. It includes 80,000 questions along with their answers, with 25% of the questions being adversarially unanswerable. We examine various properties of the dataset to show the diversity and the level of its difficulty as an MRC benchmark. By releasing this dataset, we aim to ease research on Persian reading comprehension and development of Persian question answering systems. Our experiments on different state-of-the-art pre-trained contextualized language models show 74.8% Exact Match (EM) and 87.6% F1-score that can be used as the baseline results for further research on Persian QA.