论文标题
探索PATH选择在游戏理论归因算法中的影响
An exploration of the influence of path choice in game-theoretic attribution algorithms
论文作者
论文摘要
我们比较了基于原子理论(Shapley,1953年)和InfitInitesimal(Aumann and Shapley,1974)游戏的机器学习方法,这是在理论和实验研究中对模型和整合路径的选择如何影响所得特征属性的。要深入了解介入的沙普利价值(Sundararajan and Najmi,2019; Janzing等,2019; Chen等,2019)和广义综合梯度(GIG)(Merrill等人,2019年),我们注意到Insperientional Shapley等于$ n!应用Stoke的定理,我们表明,当模型由单个特征的可分离函数和两种功能产品的总和组成时,这两种方法的路径对称性会导致相同的归因。然后,我们执行一系列具有不同程度的数据丢失性的实验,以证明介入的沙普利多路径方法如何比Aumann-Shapley的单个直线路径产生的一致归因更少。我们认为这是因为介入的沙普利使用的多个途径远离训练数据歧管,因此更有可能通过模型几乎没有支持的区域。因此,在没有更有意义的路径选择的情况下,我们提倡直线路径,因为它几乎总是会更接近数据歧管。在直线路径归因算法中,演出非常强大,因为它仍然会产生由决策树建模的原子游戏的莎普利值。
We compare machine learning explainability methods based on the theory of atomic (Shapley, 1953) and infinitesimal (Aumann and Shapley, 1974) games, in a theoretical and experimental investigation into how the model and choice of integration path can influence the resulting feature attributions. To gain insight into differences in attributions resulting from interventional Shapley values (Sundararajan and Najmi, 2019; Janzing et al., 2019; Chen et al., 2019) and Generalized Integrated Gradients (GIG) (Merrill et al., 2019) we note interventional Shapley is equivalent to a multi-path integration along $n!$ paths where $n$ is the number of model input features. Applying Stoke's theorem we show that the path symmetry of these two methods results in the same attributions when the model is composed of a sum of separable functions of individual features and a sum of two-feature products. We then perform a series of experiments with varying degrees of data missingness to demonstrate how interventional Shapley's multi-path approach can yield less consistent attributions than the single straight-line path of Aumann-Shapley. We argue this is because the multiple paths employed by interventional Shapley extend away from the training data manifold and are therefore more likely to pass through regions where the model has little support. In the absence of a more meaningful path choice, we therefore advocate the straight-line path since it will almost always pass closer to the data manifold. Among straight-line path attribution algorithms, GIG is uniquely robust since it will still yield Shapley values for atomic games modeled by decision trees.