论文标题

人类人类客户支持口语对话中的实时呼叫者意图检测

Real-time Caller Intent Detection In Human-Human Customer Support Spoken Conversations

论文作者

Rawat, Mrinal, Barres, Victor

论文摘要

在人类人类客户支持口语互动期间的代理协助需要根据呼叫者的意图触发工作流(通话的原因)。预测的及时性对于良好的用户体验至关重要。目的是使系统在代理商能够检测到它时检测呼叫者的意图(意图边界)。一些方法着重于预测离线输出,即,一旦ASR系统处理了完整的口语输入(例如,整个对话转弯)。每当在转弯中早些时候可以检测到意图时,这会引入预测中的不良延迟。关于语音助手的最新工作已在逐字级别使用增量的实时预测来检测命令结束之前的意图。但是,人指导和机器指导的语音具有非常不同的特征。在这项工作中,我们建议将一种在语音辅助方面开发的方法应用于在线实时呼叫者在人类口语互动中的意图检测问题。我们使用双重体系结构,其中两个LSTM共同训练:一个预测意图边界(IB),然后预测IB处的意图类别。我们在私人数据集上进行实验,其中包括来自电信客户支持域的人类电话交谈的成绩单。我们报告结果分析了系统的准确性以及不同体系结构对整体准确性和预测潜伏期之间的权衡的影响。

Agent assistance during human-human customer support spoken interactions requires triggering workflows based on the caller's intent (reason for call). Timeliness of prediction is essential for a good user experience. The goal is for a system to detect the caller's intent at the time the agent would have been able to detect it (Intent Boundary). Some approaches focus on predicting the output offline, i.e. once the full spoken input (e.g. the whole conversational turn) has been processed by the ASR system. This introduces an undesirable latency in the prediction each time the intent could have been detected earlier in the turn. Recent work on voice assistants has used incremental real-time predictions at a word-by-word level to detect intent before the end of a command. Human-directed and machine-directed speech however have very different characteristics. In this work, we propose to apply a method developed in the context of voice-assistant to the problem of online real time caller's intent detection in human-human spoken interactions. We use a dual architecture in which two LSTMs are jointly trained: one predicting the Intent Boundary (IB) and then other predicting the intent class at the IB. We conduct our experiments on our private dataset comprising transcripts of human-human telephone conversations from the telecom customer support domain. We report results analyzing both the accuracy of our system as well as the impact of different architectures on the trade off between overall accuracy and prediction latency.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源