用密度矩阵建模词汇歧义

论文标题

用密度矩阵建模词汇歧义

Modelling Lexical Ambiguity with Density Matrices

论文作者

Meyer, Francois, Lewis, Martha

论文摘要

单词可以具有多种感官。已经有人说，含义的组成分布模型很好地处理了被称为多义的含义变化的细性阴影，但并没有很好地处理词源无关或同义词的单词感官。从向量转移到密度矩阵使我们能够通过单词的不同感官编码概率分布，也可以在含义的组成分布模型中容纳。在本文中，我们介绍了三个新的神经模型，用于从语料库中学习密度矩阵，并测试它们在一系列组成数据集上歧视单词感官的能力。与特定组成方法配对时，我们的最佳模型优于现有的基于向量的构图模型以及强句编码器。

Words can have multiple senses. Compositional distributional models of meaning have been argued to deal well with finer shades of meaning variation known as polysemy, but are not so well equipped to handle word senses that are etymologically unrelated, or homonymy. Moving from vectors to density matrices allows us to encode a probability distribution over different senses of a word, and can also be accommodated within a compositional distributional model of meaning. In this paper we present three new neural models for learning density matrices from a corpus, and test their ability to discriminate between word senses on a range of compositional datasets. When paired with a particular composition method, our best model outperforms existing vector-based compositional models as well as strong sentence encoders.

下载PDF全文

下载文献需遵守相关版权规定

论文标题