论文标题
您的应用在哪里使用户沮丧?
Where is Your App Frustrating Users?
论文作者
论文摘要
移动应用程序的用户评论为开发人员感知用户满意度提供了通信渠道。用户遇到问题的许多应用程序功能通常由诸如“上传图片”之类的关键短语表示,这些短语可以埋葬在评论文本中。缺乏对问题功能的细粒度观点可能会掩盖开发人员对应用程序挫败用户的理解,并推迟应用程序的改进。由于对评论的语义理解不足,现有的基于模式的方法提取目标短语的方法较低,因此只能总结评论的高级主题/方面。本文提出了一种语义意识,细粒度的应用程序审查分析方法(SIRA),以提取,群集和可视化应用程序的问题特征。 SIRA的主要组成部分是一种新颖的BERT+ATTR-CRF模型,用于精细元素有问题的特征提取,该模型结合了文本描述和评论属性,以更好地模拟评论的语义并提高传统Bert-CRF模型的性能。 Sira还根据其语义关系将提取的短语簇起来,并显示了摘要的可视化。我们对来自六个应用程序的3,426个评论的评估证实了Sira在有问题的特征提取和聚类中的有效性。我们进一步对Sira进行了一项实证研究,对318,534次对18个流行应用程序的评论进行了评论,以探索其潜在的应用并研究其在现实世界实践中的有用性。
User reviews of mobile apps provide a communication channel for developers to perceive user satisfaction. Many app features that users have problems with are usually expressed by key phrases such as "upload pictures", which could be buried in the review texts. The lack of fine-grained view about problematic features could obscure the developers' understanding of where the app is frustrating users, and postpone the improvement of the apps. Existing pattern-based approaches to extract target phrases suffer from low accuracy due to insufficient semantic understanding of the reviews, thus can only summarize the high-level topics/aspects of the reviews. This paper proposes a semantic-aware, fine-grained app review analysis approach (SIRA) to extract, cluster, and visualize the problematic features of apps. The main component of SIRA is a novel BERT+Attr-CRF model for fine-grained problematic feature extraction, which combines textual descriptions and review attributes to better model the semantics of reviews and boost the performance of the traditional BERT-CRF model. SIRA also clusters the extracted phrases based on their semantic relations and presents a visualization of the summaries. Our evaluation on 3,426 reviews from six apps confirms the effectiveness of SIRA in problematic feature extraction and clustering. We further conduct an empirical study with SIRA on 318,534 reviews of 18 popular apps to explore its potential application and examine its usefulness in real-world practice.