论文标题
Wikimulti:跨语性摘要的语料库
WikiMulti: a Corpus for Cross-Lingual Summarization
论文作者
论文摘要
跨语性摘要(CLS)是用一种特定语言以不同语言的源文档制作摘要的任务。我们介绍了Wikimulti-基于15种语言的Wikipedia文章,用于跨语义摘要的新数据集。作为一组进一步研究的基准,我们评估了数据集上现有的跨语性抽象摘要方法的性能。我们在此处公开提供数据集:https://github.com/tikhonovpavel/wikimulti
Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: https://github.com/tikhonovpavel/wikimulti