AMR解析的图形分割和对齐的可区分放松

论文标题

AMR解析的图形分割和对齐的可区分放松

A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing

论文作者

Lyu, Chunchuan, Cohen, Shay B., Titov, Ivan

论文摘要

抽象含义表示（AMR）是一种宽覆盖的语义形式主义，代表句子含义为有向的无环图。要训练大多数AMR解析器，需要将图形分割为子图，并将每个这样的子图与句子中的一个单词对齐；这通常是在预处理中依靠手工制作的规则进行的。相反，我们将对齐和分割视为模型中的潜在变量，并将其作为端到端培训的一部分。由于对结构化的潜在变量的边缘化是不可行的，因此我们使用变异自动编码框架。为了确保端到端可区分优化，我们引入了分割和对齐问题的可区分放松。我们观察到，使用“贪婪”分割启发式启发式，诱导分割会产生可观的收益。我们方法的性能还涉及一个依赖\ citet {lyu-titov-2018-amr}的分割规则的模型，该模型是手工制作以处理单个AMR构造的。

Abstract Meaning Representations (AMR) are a broad-coverage semantic formalism which represents sentence meaning as a directed acyclic graph. To train most AMR parsers, one needs to segment the graph into subgraphs and align each such subgraph to a word in a sentence; this is normally done at preprocessing, relying on hand-crafted rules. In contrast, we treat both alignment and segmentation as latent variables in our model and induce them as part of end-to-end training. As marginalizing over the structured latent variables is infeasible, we use the variational autoencoding framework. To ensure end-to-end differentiable optimization, we introduce a differentiable relaxation of the segmentation and alignment problems. We observe that inducing segmentation yields substantial gains over using a `greedy' segmentation heuristic. The performance of our method also approaches that of a model that relies on the segmentation rules of \citet{lyu-titov-2018-amr}, which were hand-crafted to handle individual AMR constructions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题