雾什么时候清除？关于医疗应用机器学习的解释性：一项调查

论文标题

雾什么时候清除？关于医疗应用机器学习的解释性：一项调查

When will the mist clear? On the Interpretability of Machine Learning for Medical Applications: a survey

论文作者

Banegas-Luna, Antonio-Jesús, Peña-García, Jorge, Iftene, Adrian, Guadagni, Fiorella, Ferroni, Patrizia, Scarpato, Noemi, Zanzotto, Fabio Massimo, Bueno-Crespo, Andrés, Pérez-Sánchez, Horacio

论文摘要

人工智能正在提供惊人的结果，而药物是其最喜欢的游乐场之一。在几十年中，计算机可能能够制定诊断并选择正确的治疗方法，而机器人可能会进行手术操作，并且会话剂可以与患者作为虚拟教练进行互动。机器学习，尤其是深层神经网络是这场革命的背后。在这种情况下，重要决策将由独立的机器控制，这些机器已经从提供的数据中学到了预测模型。在医学上最具挑战性的目标之一是癌症诊断和疗法，但是，为了开始这项革命，需要对软件工具进行调整以满足新的要求。从这个意义上讲，学习工具正成为Python和Matlab库中的商品，仅举两个，但是要利用所有可能性，必须充分了解模型的解释以及哪些模型比其他模型更容易解释。在这项调查中，我们分析了当前适用于医学的机器学习模型，框架，数据库和其他相关工具（特别是用于癌症研究），并讨论了它们的可解释性，性能和必要的输入数据。从可用的证据来看，ANN，LR和SVM已被观察到是首选模型。此外，在GPU和面向张量的编程库的快速发展的支持下，CNN的重要性已变得重要。但是，很少考虑医生对结果的解释性，这是需要改进的因素。因此，我们认为这项研究是对该问题的及时贡献。

Artificial Intelligence is providing astonishing results, with medicine being one of its favourite playgrounds. In a few decades, computers may be capable of formulating diagnoses and choosing the correct treatment, while robots may perform surgical operations, and conversational agents could interact with patients as virtual coaches. Machine Learning and, in particular, Deep Neural Networks are behind this revolution. In this scenario, important decisions will be controlled by standalone machines that have learned predictive models from provided data. Among the most challenging targets of interest in medicine are cancer diagnosis and therapies but, to start this revolution, software tools need to be adapted to cover the new requirements. In this sense, learning tools are becoming a commodity in Python and Matlab libraries, just to name two, but to exploit all their possibilities, it is essential to fully understand how models are interpreted and which models are more interpretable than others. In this survey, we analyse current machine learning models, frameworks, databases and other related tools as applied to medicine - specifically, to cancer research - and we discuss their interpretability, performance and the necessary input data. From the evidence available, ANN, LR and SVM have been observed to be the preferred models. Besides, CNNs, supported by the rapid development of GPUs and tensor-oriented programming libraries, are gaining in importance. However, the interpretability of results by doctors is rarely considered which is a factor that needs to be improved. We therefore consider this study to be a timely contribution to the issue.

下载PDF全文

下载文献需遵守相关版权规定

论文标题