论文标题
机器学习软件系统中的质量问题
Quality issues in Machine Learning Software Systems
论文作者
论文摘要
上下文:在各个领域都观察到需求不断增加,以利用机器学习(ML)解决复杂问题。 ML模型作为软件组件实现,并部署在机器学习软件系统(MLSS)中。问题:非常需要确保MLSS的服务质量。此类系统的错误决定或不良决定会导致其他系统的故障,重大财务损失甚至对人类生命的威胁。 MLSSS的质量保证被认为是一项具有挑战性的任务,目前是一个热门研究主题。此外,重要的是要涵盖MLSS质量的所有各个方面。目的:本文旨在从从业者的角度研究MLSS中实际质量问题的特征。这项实证研究旨在确定与MLSS质量差有关的坏实践目录。方法:我们计划对从业人员/专家进行一系列访谈,认为访谈是在处理质量问题时检索其经验和实践的最佳方法。我们希望在此步骤中开发的问题目录还将帮助我们以后确定MLSS质量问题的严重性,根本原因以及可能的补救措施,从而使我们能够为ML模型和MLSS开发有效的质量保证工具。
Context: An increasing demand is observed in various domains to employ Machine Learning (ML) for solving complex problems. ML models are implemented as software components and deployed in Machine Learning Software Systems (MLSSs). Problem: There is a strong need for ensuring the serving quality of MLSSs. False or poor decisions of such systems can lead to malfunction of other systems, significant financial losses, or even threat to human life. The quality assurance of MLSSs is considered as a challenging task and currently is a hot research topic. Moreover, it is important to cover all various aspects of the quality in MLSSs. Objective: This paper aims to investigate the characteristics of real quality issues in MLSSs from the viewpoint of practitioners. This empirical study aims to identify a catalog of bad-practices related to poor quality in MLSSs. Method: We plan to conduct a set of interviews with practitioners/experts, believing that interviews are the best method to retrieve their experience and practices when dealing with quality issues. We expect that the catalog of issues developed at this step will also help us later to identify the severity, root causes, and possible remedy for quality issues of MLSSs, allowing us to develop efficient quality assurance tools for ML models and MLSSs.