论文标题

恢复:COVID-19新闻信誉研究的多模式存储库

ReCOVery: A Multimodal Repository for COVID-19 News Credibility Research

论文作者

Zhou, Xinyi, Mulay, Apurva, Ferrara, Emilio, Zafarani, Reza

论文摘要

2019年12月,Covid-19的爆发于2019年12月在中国首次确定为1月的全球紧急情况,世界卫生组织(WHO)于2020年3月被宣布为全球紧急情况。除了这个大流行外,我们还经历了信誉较低的信息,例如虚假新闻和阴谋。在这项工作中,我们提出了Recovery,这是一个设计和构建的存储库,旨在促进有关Covid-19的此类信息的研究。我们首先广泛搜索和调查了约2,000名新闻发布者,从中有60个具有极高的[高或低]信誉。通过继承其发表的媒体的信誉,从1月至2020年5月发表的有关冠状病毒的2,029篇新闻文章将在存储库中收集,以及140,820条推文,这些推文揭示了这些新闻文章如何在Twitter社交网络上传播。该存储库提供有关冠状病毒的新闻文章的多模式信息,包括文本,视觉,时间和网络信息。获得新闻信誉的方式允许数据集可伸缩性和标签准确性之间的权衡。进行了广泛的实验以介绍数据统计和分布,并提供了预测新闻信誉的基线性能,以便可以比较未来的方法。我们的存储库可从http://coronavirus-fakenews.com获得。

First identified in Wuhan, China, in December 2019, the outbreak of COVID-19 has been declared as a global emergency in January, and a pandemic in March 2020 by the World Health Organization (WHO). Along with this pandemic, we are also experiencing an "infodemic" of information with low credibility such as fake news and conspiracies. In this work, we present ReCOVery, a repository designed and constructed to facilitate research on combating such information regarding COVID-19. We first broadly search and investigate ~2,000 news publishers, from which 60 are identified with extreme [high or low] levels of credibility. By inheriting the credibility of the media on which they were published, a total of 2,029 news articles on coronavirus, published from January to May 2020, are collected in the repository, along with 140,820 tweets that reveal how these news articles have spread on the Twitter social network. The repository provides multimodal information of news articles on coronavirus, including textual, visual, temporal, and network information. The way that news credibility is obtained allows a trade-off between dataset scalability and label accuracy. Extensive experiments are conducted to present data statistics and distributions, as well as to provide baseline performances for predicting news credibility so that future methods can be compared. Our repository is available at http://coronavirus-fakenews.com.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源