论文标题
通用依赖关系v2:一个不断发展的多语言树库集合
Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
论文作者
论文摘要
通用依赖性是在基于依赖关系的词汇框架内为许多语言创建跨语言一致的树库注释的开放社区努力。注释是由语言动机的单词细分组成的;形态学层,包括引理,通用的言论部分和标准化的形态特征;以及句法层,重点是谓词,参数和修饰符之间的句法关系。在本文中,我们描述了指南的版本2(UD V2),讨论从UD V1到UD V2的主要更改,并概述了目前可用于90种语言的Treebanks。
Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word segmentation; a morphological layer comprising lemmas, universal part-of-speech tags, and standardized morphological features; and a syntactic layer focusing on syntactic relations between predicates, arguments and modifiers. In this paper, we describe version 2 of the guidelines (UD v2), discuss the major changes from UD v1 to UD v2, and give an overview of the currently available treebanks for 90 languages.