论文标题

通用依赖关系v2:一个不断发展的多语言树库集合

Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection

论文作者

Nivre, Joakim, de Marneffe, Marie-Catherine, Ginter, Filip, Hajič, Jan, Manning, Christopher D., Pyysalo, Sampo, Schuster, Sebastian, Tyers, Francis, Zeman, Daniel

论文摘要

通用依赖性是在基于依赖关系的词汇框架内为许多语言创建跨语言一致的树库注释的开放社区努力。注释是由语言动机的单词细分组成的;形态学层,包括引理,通用的言论部分和标准化的形态特征;以及句法层,重点是谓词,参数和修饰符之间的句法关系。在本文中,我们描述了指南的版本2(UD V2),讨论从UD V1到UD V2的主要更改,并概述了目前可用于90种语言的Treebanks。

Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word segmentation; a morphological layer comprising lemmas, universal part-of-speech tags, and standardized morphological features; and a syntactic layer focusing on syntactic relations between predicates, arguments and modifiers. In this paper, we describe version 2 of the guidelines (UD v2), discuss the major changes from UD v1 to UD v2, and give an overview of the currently available treebanks for 90 languages.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源