论文标题

Web Tape提取,检索和增强:一项调查

Web Table Extraction, Retrieval and Augmentation: A Survey

论文作者

Zhang, Shuo, Balog, Krisztian

论文摘要

表是组织和操纵数据的强大而流行的工具。可以在网络上找到大量表,这代表了宝贵的知识资源。这项调查的目的是综合并介绍有关网络表的二十年研究。特别是,我们将现有文献组织成六个主要信息访问任务的主要类别:表提取,表解释,表搜索,问答,知识基础增强和表增强。对于这些任务中的每一个,我们都会识别和描述开创性方法,呈现相关资源,并指出不同任务之间的相互依赖性。

Tables are a powerful and popular tool for organizing and manipulating data. A vast number of tables can be found on the Web, which represents a valuable knowledge resource. The objective of this survey is to synthesize and present two decades of research on web tables. In particular, we organize existing literature into six main categories of information access tasks: table extraction, table interpretation, table search, question answering, knowledge base augmentation, and table augmentation. For each of these tasks, we identify and describe seminal approaches, present relevant resources, and point out interdependencies among the different tasks.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源