聊天机器人与人工智能的互动：与文本分类的T5和语言变压器合奏的人类数据增强

论文标题

聊天机器人与人工智能的互动：与文本分类的T5和语言变压器合奏的人类数据增强

Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification

论文作者

Bird, Jordan J., Ekárt, Anikó, Faria, Diego R.

论文摘要

在这项工作中，我们将聊天机器人与人工智能（CI-AI）框架介绍，以此作为培训深度学习聊天机器人进行任务分类的方法。智能系统通过人工释义增强了人为的数据，以生成大量的培训数据，以进一步的经典，注意力和基于语言的学习方法进行自然语言处理。要求人类对任务识别的命令和问题进行解释，以进一步执行机器。命令和问题分为培训和验证集。总共记录了483个响应。其次，训练集由T5模型解释，以便使用进一步的数据进行增强。七种基于最先进的变压器的文本分类算法（Bert，Distilbert，Roberta，Distilroberta，XLM，XLM-Roberta和XLNet）在对两个时期的训练数据进行微调后，都针对这两组均进行了基准测试。我们发现，当T5模型增强培训数据时，所有模型均得到改善，平均分类准确性提高了4.01％。最好的结果是对T5增强数据进行培训的罗伯塔模型，该数据达到了98.96％的分类精度。最后，我们发现，通过输出标签预测的逻辑回归，五个表现最佳的变压器模型的合奏导致人类响应数据集的准确度为99.59％。一个高度表现的模型使智能系统可以通过类似聊天机器人的界面来解释社交交互级别的人类命令（例如，“机器人，我们可以进行对话吗？”），并允许非技术用户更好地对AI访问AI。

In this work, we present the Chatbot Interaction with Artificial Intelligence (CI-AI) framework as an approach to the training of deep learning chatbots for task classification. The intelligent system augments human-sourced data via artificial paraphrasing in order to generate a large set of training data for further classical, attention, and language transformation-based learning approaches for Natural Language Processing. Human beings are asked to paraphrase commands and questions for task identification for further execution of a machine. The commands and questions are split into training and validation sets. A total of 483 responses were recorded. Secondly, the training set is paraphrased by the T5 model in order to augment it with further data. Seven state-of-the-art transformer-based text classification algorithms (BERT, DistilBERT, RoBERTa, DistilRoBERTa, XLM, XLM-RoBERTa, and XLNet) are benchmarked for both sets after fine-tuning on the training data for two epochs. We find that all models are improved when training data is augmented by the T5 model, with an average increase of classification accuracy by 4.01%. The best result was the RoBERTa model trained on T5 augmented data which achieved 98.96% classification accuracy. Finally, we found that an ensemble of the five best-performing transformer models via Logistic Regression of output label predictions led to an accuracy of 99.59% on the dataset of human responses. A highly-performing model allows the intelligent system to interpret human commands at the social-interaction level through a chatbot-like interface (e.g. "Robot, can we have a conversation?") and allows for better accessibility to AI by non-technical users.

下载PDF全文

下载文献需遵守相关版权规定

论文标题