论文标题
意大利个人著作的作者归因的数据集和模型
Datasets and Models for Authorship Attribution on Italian Personal Writings
论文作者
论文摘要
现有关于作者归因(AA)的研究重点关注的是有很多数据(例如小说),主要是英语。我们通过两个新型数据集中的意大利文本对AA进行AA,并分析类型,主题,性别和长度之间的相互作用。结果表明,即使数据很少,AV也是可行的,但是更多的证据有帮助。性别和主题可以是指示性线索,如果不控制,它们可能会超越个人风格的更具体的方面。
Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is available (e.g novels), mainly in English. We approach AA via Authorship Verification on short Italian texts in two novel datasets, and analyze the interaction between genre, topic, gender and length. Results show that AV is feasible even with little data, but more evidence helps. Gender and topic can be indicative clues, and if not controlled for, they might overtake more specific aspects of personal style.