一个统治所有人的解释 - 整体一致的解释

论文标题

一个统治所有人的解释 - 整体一致的解释

One Explanation to Rule them All -- Ensemble Consistent Explanations

论文作者

Artelt, André, Vrachimis, Stelios, Eliades, Demetrios, Polycarpou, Marios, Hammer, Barbara

论文摘要

透明度是现代基于AI的决策系统的主要要求。实现透明度的一种流行方法是通过解释。为单个决策系统提出了各种各样的解释。在实践中，通常情况下，有一个（即合奏）的决策集合（即合奏），而不是仅在复杂系统中，而不是单个决策。不幸的是，单个决策系统的解释方法不容易适用于合奏 - 即，与对所有观察到的现象的单个一致的解释相比，它们不一定是一致的，不一定是一致的，因此有用，更难以理解。我们提出了一个新颖的概念，以始终如一地解释本地的一系列决策，并通过单一的解释 - 我们介绍了一个正式的概念，以及使用反事实解释的特定实现。

Transparency is a major requirement of modern AI based decision making systems deployed in real world. A popular approach for achieving transparency is by means of explanations. A wide variety of different explanations have been proposed for single decision making systems. In practice it is often the case to have a set (i.e. ensemble) of decisions that are used instead of a single decision only, in particular in complex systems. Unfortunately, explanation methods for single decision making systems are not easily applicable to ensembles -- i.e. they would yield an ensemble of individual explanations which are not necessarily consistent, hence less useful and more difficult to understand than a single consistent explanation of all observed phenomena. We propose a novel concept for consistently explaining an ensemble of decisions locally with a single explanation -- we introduce a formal concept, as well as a specific implementation using counterfactual explanations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题