论文标题

紧张,方面和基于情绪的事件提取用于情况分析和危机管理

Tense, aspect and mood based event extraction for situation analysis and crisis management

论文作者

Hürriyetoğlu, Ali

论文摘要

如今,事件提取系统主要处理有关情况的时间和模态资格的相对较少的信息,主要在过去时处理自信句子。但是,具有更广泛的时态,方面和情绪范围的系统可以提供更好的分析,并且可以在更广泛的文本分析应用程序中使用。本文为土耳其语发展了这样的系统。这是通过扩展开源信息挖掘和分析(Optima)研究小组的事件提取软件来实现的,通过添加部分语法来实现适当的扩展,以提高TAM(时态,方面和情绪标记),副副词分析和Express的匹配功能,并在Corleone标准的corleone标准中构建适当的词典,从而改善TAM(时态,方面和情绪标记)。这些扩展基于IV的锚定关系理论(Temürcü,2007年,2011年),这是一个跨语言上适用的语义框架,用于分析时态,方面和情绪相关类别。结果是一个系统,除了提取基本事件结构外,还可以根据新闻报告中给出的句子,根据其时间,模态和自愿/iLcutionary值对句子进行分类。尽管重点是关于自然灾害,疾病爆发和人为灾难的新闻报道,但该方法可以适应其他语言,领域和流派。此事件提取和分类系统,随着进一步的发展,可以为预防环境和人道主义风险的自动浏览系统提供基础。

Nowadays event extraction systems mainly deal with a relatively small amount of information about temporal and modal qualifications of situations, primarily processing assertive sentences in the past tense. However, systems with a wider coverage of tense, aspect and mood can provide better analyses and can be used in a wider range of text analysis applications. This thesis develops such a system for Turkish language. This is accomplished by extending Open Source Information Mining and Analysis (OPTIMA) research group's event extraction software, by implementing appropriate extensions in the semantic representation format, by adding a partial grammar which improves the TAM (Tense, Aspect and Mood) marker, adverb analysis and matching functions of ExPRESS, and by constructing an appropriate lexicon in the standard of CORLEONE. These extensions are based on iv the theory of anchoring relations (Temürcü, 2007, 2011) which is a crosslinguistically applicable semantic framework for analyzing tense, aspect and mood related categories. The result is a system which can, in addition to extracting basic event structures, classify sentences given in news reports according to their temporal, modal and volitional/illocutionary values. Although the focus is on news reports of natural disasters, disease outbreaks and man-made disasters in Turkish language, the approach can be adapted to other languages, domains and genres. This event extraction and classification system, with further developments, can provide a basis for automated browsing systems for preventing environmental and humanitarian risk.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源