论文标题
一个统治所有人的解释 - 整体一致的解释
One Explanation to Rule them All -- Ensemble Consistent Explanations
论文作者
论文摘要
透明度是现代基于AI的决策系统的主要要求。实现透明度的一种流行方法是通过解释。为单个决策系统提出了各种各样的解释。在实践中,通常情况下,有一个(即合奏)的决策集合(即合奏),而不是仅在复杂系统中,而不是单个决策。不幸的是,单个决策系统的解释方法不容易适用于合奏 - 即,与对所有观察到的现象的单个一致的解释相比,它们不一定是一致的,不一定是一致的,因此有用,更难以理解。我们提出了一个新颖的概念,以始终如一地解释本地的一系列决策,并通过单一的解释 - 我们介绍了一个正式的概念,以及使用反事实解释的特定实现。
Transparency is a major requirement of modern AI based decision making systems deployed in real world. A popular approach for achieving transparency is by means of explanations. A wide variety of different explanations have been proposed for single decision making systems. In practice it is often the case to have a set (i.e. ensemble) of decisions that are used instead of a single decision only, in particular in complex systems. Unfortunately, explanation methods for single decision making systems are not easily applicable to ensembles -- i.e. they would yield an ensemble of individual explanations which are not necessarily consistent, hence less useful and more difficult to understand than a single consistent explanation of all observed phenomena. We propose a novel concept for consistently explaining an ensemble of decisions locally with a single explanation -- we introduce a formal concept, as well as a specific implementation using counterfactual explanations.