论文标题

Pynsett:可编程关系提取器

Pynsett: A programmable relation extractor

论文作者

Cetoli, Alberto

论文摘要

本文通过将文本解析为语义图,建议对英语的可编程关系提取方法。一个人可以用简单的英语定义规则,以将图案作为图表表示。这些规则旨在捕获文档的语义内容,从而具有灵活性和临时实体。关系提取是一项复杂的任务,通常需要相当大的培训语料库。此处提出的方法是在有限的文档集合中提取专业本体论的理想选择。

This paper proposes a programmable relation extraction method for the English language by parsing texts into semantic graphs. A person can define rules in plain English that act as matching patterns onto the graph representation. These rules are designed to capture the semantic content of the documents, allowing for flexibility and ad-hoc entities. Relation extraction is a complex task that typically requires sizable training corpora. The method proposed here is ideal for extracting specialized ontologies in a limited collection of documents.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源