神经实体链接：基于深度学习的模型调查

论文标题

神经实体链接：基于深度学习的模型调查

Neural Entity Linking: A Survey of Models Based on Deep Learning

论文作者

Sevgili, Ozge, Shelmanov, Artem, Arkhipov, Mikhail, Panchenko, Alexander, Biemann, Chris

论文摘要

这项调查介绍了自然语言处理中“深度学习革命”以来自2015年以来开发的最新神经实体（EL）系统的全面描述。它的目标是系统化神经实体链接系统的设计特征，并将其性能与普通基准的非凡经典方法进行比较。这项工作会提炼神经系统的通用体系结构，并讨论其组成部分，例如候选，提及封闭式编码和实体排名，总结了它们中的每一个的突出方法。这种一般体系结构的各种修改都由几个共同的主题进行分组：联合实体提及检测和歧义，全球链接的模型，独立于域独立的技术，包括零射击和远距离监督方法以及跨语言方法。由于许多神经模型利用实体和提及/上下文嵌入代表其含义，因此这项工作还概述了著名的实体嵌入技术。最后，调查涉及实体链接的应用，重点是最近出现的使用案例，以增强基于变压器体系结构的深度预训练的蒙版语言模型。

This survey presents a comprehensive description of recent neural entity linking (EL) systems developed since 2015 as a result of the "deep learning revolution" in natural language processing. Its goal is to systemize design features of neural entity linking systems and compare their performance to the remarkable classic methods on common benchmarks. This work distills a generic architecture of a neural EL system and discusses its components, such as candidate generation, mention-context encoding, and entity ranking, summarizing prominent methods for each of them. The vast variety of modifications of this general architecture are grouped by several common themes: joint entity mention detection and disambiguation, models for global linking, domain-independent techniques including zero-shot and distant supervision methods, and cross-lingual approaches. Since many neural models take advantage of entity and mention/context embeddings to represent their meaning, this work also overviews prominent entity embedding techniques. Finally, the survey touches on applications of entity linking, focusing on the recently emerged use-case of enhancing deep pre-trained masked language models based on the Transformer architecture.

下载PDF全文

下载文献需遵守相关版权规定

论文标题