信用句子决策图中的可行推断

论文标题

信用句子决策图中的可行推断

Tractable Inference in Credal Sentential Decision Diagrams

论文作者

Mattei, Lilith, Antonucci, Alessandro, Mauá, Denis Deratani, Facchini, Alessandro, Llerena, Julissa Villanueva

论文摘要

概率句子决策图是逻辑电路，其中概率值注释了分离门的输入。它们允许在布尔变量集上定义的关节概率质量函数的紧凑表示，这也与电路定义的逻辑约束一致。这种模型中的概率通常是从一组观测值中学到的。当数据稀缺，不可靠或冲突时，这会导致过度自信和先前的依赖性推论。在这项工作中，我们开发了信用句子决策图，即其概率对应物的概括，允许用（所谓的信用）质量函数集替换本地概率。这些模型诱导了一组布尔变量集合的联合信用集，该集合将概率零分为与逻辑约束不一致的状态。这些模型得出了三种推论算法，这些算法可以计算：（i）观察到任意数量变量的下部和上层概率；（ii）给定观察结果的单个变量状态的下部和上部条件概率；（iii）是否所有与信用规范兼容的概率句子决策图是否具有对给定变量的一组变量的最可能的解释。这些推论是可以解决的，因为所有三种算法，基于自下而上的遍历，并在分离门上具有局部线性编程任务，都可以在多项式时间内相对于电路尺寸解决。对于第一个经验验证，我们考虑了一个基于嘈杂的七个段显示图像的简单应用程序。观察到信用模型可以正确区分简单和难以检测的实例，并且超过其他生成模型无法应对逻辑约束。

Probabilistic sentential decision diagrams are logic circuits where the inputs of disjunctive gates are annotated by probability values. They allow for a compact representation of joint probability mass functions defined over sets of Boolean variables, that are also consistent with the logical constraints defined by the circuit. The probabilities in such a model are usually learned from a set of observations. This leads to overconfident and prior-dependent inferences when data are scarce, unreliable or conflicting. In this work, we develop the credal sentential decision diagrams, a generalisation of their probabilistic counterpart that allows for replacing the local probabilities with (so-called credal) sets of mass functions. These models induce a joint credal set over the set of Boolean variables, that sharply assigns probability zero to states inconsistent with the logical constraints. Three inference algorithms are derived for these models, these allow to compute: (i) the lower and upper probabilities of an observation for an arbitrary number of variables; (ii) the lower and upper conditional probabilities for the state of a single variable given an observation; (iii) whether or not all the probabilistic sentential decision diagrams compatible with the credal specification have the same most probable explanation of a given set of variables given an observation of the other variables. These inferences are tractable, as all the three algorithms, based on bottom-up traversal with local linear programming tasks on the disjunctive gates, can be solved in polynomial time with respect to the circuit size. For a first empirical validation, we consider a simple application based on noisy seven-segment display images. The credal models are observed to properly distinguish between easy and hard-to-detect instances and outperform other generative models not able to cope with logical constraints.

下载PDF全文

下载文献需遵守相关版权规定

论文标题