用于文本分类的新型BGCAPSUEN网络

论文标题

用于文本分类的新型BGCAPSUEN网络

A Novel BGCapsule Network for Text Classification

论文作者

Gangwar, Akhilesh Kumar, Ravi, Vadlamani

论文摘要

即使对于现代深度学习网络，几项文本分类任务，例如情感分析，新闻分类，多标签分类和意见分类也是挑战性问题。最近，提出了胶囊网络（CAPSNET）进行图像分类。已经表明，在卷积神经网络（CNN）上，瓶颈具有多个优势，而其在文本领域的有效性却较少。在本文中，我们提出了一种新型的混合体系结构，即bgcapsule，它是一个胶囊模型，该模型是多个文本分类任务的双向门控复发单元（BIGRU）的集合。我们采用了双向GRU的合奏，用于主要胶囊层之前的特征提取层。混合体系结构在执行了基本的预处理步骤后，由五层组成：基于手套的嵌入层，基于BigRu的集合层，主胶囊层，一个扁平的层和完全连接的relu层，然后是完全连接的软max层。为了评估BGCAPSUEL的有效性，我们对五个基准数据集进行了广泛的实验（从10,000个记录到700,000个记录），包括电影审查（MR IMDB 2005），AG News DataSet，DBPedia Ontology Dataset，Yelp Review Yelp Review full DataSet和Yelp审查Polary Polartity Polary Dataset。这些基准涵盖了几个文本分类任务，例如新闻分类，情感分析，多类分类，多标签分类和意见分类。我们发现，与现有方法相比，我们提出的架构（BGCAPSUEL）在没有任何外部语言知识（例如积极的情感关键字和负面情感关键字）的情况下实现了更好的准确性。此外，与其他现有技术相比，BGCAPSUE的收敛速度更快。

Several text classification tasks such as sentiment analysis, news categorization, multi-label classification and opinion classification are challenging problems even for modern deep learning networks. Recently, Capsule Networks (CapsNets) are proposed for image classification. It has been shown that CapsNets have several advantages over Convolutional Neural Networks (CNNs), while their validity in the domain of text has been less explored. In this paper, we propose a novel hybrid architecture viz., BGCapsule, which is a Capsule model preceded by an ensemble of Bidirectional Gated Recurrent Units (BiGRU) for several text classification tasks. We employed an ensemble of Bidirectional GRUs for feature extraction layer preceding the primary capsule layer. The hybrid architecture, after performing basic pre-processing steps, consists of five layers: an embedding layer based on GloVe, a BiGRU based ensemble layer, a primary capsule layer, a flatten layer and fully connected ReLU layer followed by a fully connected softmax layer. In order to evaluate the effectiveness of BGCapsule, we conducted extensive experiments on five benchmark datasets (ranging from 10,000 records to 700,000 records) including Movie Review (MR Imdb 2005), AG News dataset, Dbpedia ontology dataset, Yelp Review Full dataset and Yelp review polarity dataset. These benchmarks cover several text classification tasks such as news categorization, sentiment analysis, multiclass classification, multi-label classification and opinion classification. We found that our proposed architecture (BGCapsule) achieves better accuracy compared to the existing methods without the help of any external linguistic knowledge such as positive sentiment keywords and negative sentiment keywords. Further, BGCapsule converged faster compared to other extant techniques.

下载PDF全文

下载文献需遵守相关版权规定

论文标题