论文标题

Wikimulti:跨语性摘要的语料库

WikiMulti: a Corpus for Cross-Lingual Summarization

论文作者

Tikhonov, Pavel, Malykh, Valentin

论文摘要

跨语性摘要(CLS)是用一种特定语言以不同语言的源文档制作摘要的任务。我们介绍了Wikimulti-基于15种语言的Wikipedia文章,用于跨语义摘要的新数据集。作为一组进一步研究的基准,我们评估了数据集上现有的跨语性抽象摘要方法的性能。我们在此处公开提供数据集:https://github.com/tikhonovpavel/wikimulti

Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: https://github.com/tikhonovpavel/wikimulti

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源