论文标题

nuig-shubhanker@dravidian-codemix-fire2020:使用XLNET的代码混合Dravidian文本的情感分析

NUIG-Shubhanker@Dravidian-CodeMix-FIRE2020: Sentiment Analysis of Code-Mixed Dravidian text using XLNet

论文作者

Banerjee, Shubhanker, Jayapal, Arun, Thavareesan, Sajeetha

论文摘要

社交媒体已经渗透到多语言社会中,但是大多数人都使用英语作为交流的首选语言。因此,他们在对话中将他们的文化语言与英语混合在一起,从而使多种语言数据混合在一起看起来很自然,称呼此代码混音数据,如今的World.Downstream NLP任务在使用此类数据的downstream NLP任务中,由于其在多种语言中传播的语义性质很具有挑战性。 Malayalam-English数据集。

Social media has penetrated into multilingual societies, however most of them use English to be a preferred language for communication. So it looks natural for them to mix their cultural language with English during conversations resulting in abundance of multilingual data, call this code-mixed data, available in todays' world.Downstream NLP tasks using such data is challenging due to the semantic nature of it being spread across multiple languages.One such Natural Language Processing task is sentiment analysis, for this we use an auto-regressive XLNet model to perform sentiment analysis on code-mixed Tamil-English and Malayalam-English datasets.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源