论文标题

来自普通话和英语的参考生产计算建模的经验教训

Lessons from Computational Modelling of Reference Production in Mandarin and English

论文作者

Chen, Guanyi, van Deemter, Kees

论文摘要

参考表达生成(REG)算法提供了参考表达式产生的计算模型。在较早的工作中,引入了普通话中的参考表达式(RES)。在本文中,我们注释该语料库,评估经典的Reg算法,并将结果与​​对英语转介表达式评估评估的较早结果进行比较。接下来,我们对语料库进行了深入的分析,重点介绍了由普通话语法引起的问题。我们讨论了在调查过程中揭示的先前REG评估的缺点,并重点介绍了一些令人惊讶的结果。也许最引人注目的是,我们发现比以前的研究所建议的不仅是普通话,而且在英语中也比以前的研究所建议的要高得多。

Referring expression generation (REG) algorithms offer computational models of the production of referring expressions. In earlier work, a corpus of referring expressions (REs) in Mandarin was introduced. In the present paper, we annotate this corpus, evaluate classic REG algorithms on it, and compare the results with earlier results on the evaluation of REG for English referring expressions. Next, we offer an in-depth analysis of the corpus, focusing on issues that arise from the grammar of Mandarin. We discuss shortcomings of previous REG evaluations that came to light during our investigation and we highlight some surprising results. Perhaps most strikingly, we found a much higher proportion of under-specified expressions than previous studies had suggested, not just in Mandarin but in English as well.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源