论文标题

MROBUST04:TREC稳健2004基准的多语言版本

mRobust04: A Multilingual Version of the TREC Robust 2004 Benchmark

论文作者

Jeronymo, Vitor, Nascimento, Mauricio, Lotufo, Roberto, Nogueira, Rodrigo

论文摘要

Robust 2004是一个信息检索基准,其每个查询的大量判断使其成为可靠的评估数据集。在本文中,我们介绍了Mrobust04,这是一种多语言版本的robust04,使用Google Translate翻译为8种语言。我们还提供了该数据集上三个不同多语言检索器的结果。该数据集可从https://huggingface.co/datasets/unicamp-dl/mrobust获得

Robust 2004 is an information retrieval benchmark whose large number of judgments per query make it a reliable evaluation dataset. In this paper, we present mRobust04, a multilingual version of Robust04 that was translated to 8 languages using Google Translate. We also provide results of three different multilingual retrievers on this dataset. The dataset is available at https://huggingface.co/datasets/unicamp-dl/mrobust

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源