时间序列分类中事后可解释性方法的评估

论文标题

时间序列分类中事后可解释性方法的评估

Evaluation of post-hoc interpretability methods in time-series classification

论文作者

Turbé, Hugues, Bjelogrlic, Mina, Lovis, Christian, Mengaldo, Gianmarco

论文摘要

事后解释性方法是解释神经网络结果的关键工具。近年来已经出现了几种事后方法，但是当应用于给定的任务时，它们会产生不同的结果，从而提出了哪种方法最适合提供正确的事后解释性的问题。要了解每种方法的性能，对可解释性方法的定量评估至关重要。但是，目前可用的框架有几个缺点，这些缺点阻碍了事后解释性方法的采用，尤其是在高风险部门中。在这项工作中，我们提出了一个具有定量指标的框架，以评估现有的事后可解释性方法的性能，尤其是在时间序列分类中。我们表明，文献中确定的几个缺点是解决了人类判断，重新培训和遮挡样品时数据分布的转移的依赖。我们还设计一个具有已知判别特征和可调复杂性的合成数据集。提出的方法和定量指标可用于了解在实际应用中获得的可解释性方法的可靠性。反过来，它们可以嵌入到需要准确的可解释性结果的关键领域中的操作工作流中，例如监管策略。

Post-hoc interpretability methods are critical tools to explain neural-network results. Several post-hoc methods have emerged in recent years, but when applied to a given task, they produce different results, raising the question of which method is the most suitable to provide correct post-hoc interpretability. To understand the performance of each method, quantitative evaluation of interpretability methods is essential. However, currently available frameworks have several drawbacks which hinders the adoption of post-hoc interpretability methods, especially in high-risk sectors. In this work, we propose a framework with quantitative metrics to assess the performance of existing post-hoc interpretability methods in particular in time series classification. We show that several drawbacks identified in the literature are addressed, namely dependence on human judgement, retraining, and shift in the data distribution when occluding samples. We additionally design a synthetic dataset with known discriminative features and tunable complexity. The proposed methodology and quantitative metrics can be used to understand the reliability of interpretability methods results obtained in practical applications. In turn, they can be embedded within operational workflows in critical fields that require accurate interpretability results for e.g., regulatory policies.

下载PDF全文

下载文献需遵守相关版权规定

论文标题