评估外观引擎

论文标题

评估外观引擎

Evaluating the Apperception Engine

论文作者

Evans, Richard, Hernandez-Orallo, Jose, Welbl, Johannes, Kohli, Pushmeet, Sergot, Marek

论文摘要

Acterception引擎是无监督的学习系统。给定一系列感觉输入，它构建了符号因果理论，既解释了感觉序列又满足一组统一条件。统一条件坚持认为该理论的成分 - 对象，属性和法律必须整合到一个连贯的整体中。一旦构建了理论，就可以应用它来预测未来的传感器读数，提前读数或估算缺失的读数。在本文中，我们在各种领域中评估了外观引擎，包括蜂窝自动机，节奏和简单的托儿所，多模式结合问题，遮挡任务和序列诱导智能测试。在每个域中，我们测试了发动机预测未来传感器值，回顾早期传感器值并估算缺失的感觉数据的能力。该发动机在所有这些领域都表现良好，极大地超过了神经网基线和最先进的归纳逻辑编程系统。这些结果很重要，因为神经网通常难以解决结合问题（其中必须以某种方式将来自不同方式的信息组合为一个统一对象的不同方面），并且无法求解遮挡任务（其中有时可见，有时从视图中掩盖了对象）。我们特别注意到，在序列诱导智能测试中，我们的系统实现了人类水平的性能。这是值得注意的，因为我们的系统不是专门用于解决智能测试的定制系统，而是旨在理解任何感觉序列的通用系统。

The Apperception Engine is an unsupervised learning system. Given a sequence of sensory inputs, it constructs a symbolic causal theory that both explains the sensory sequence and also satisfies a set of unity conditions. The unity conditions insist that the constituents of the theory - objects, properties, and laws - must be integrated into a coherent whole. Once a theory has been constructed, it can be applied to predict future sensor readings, retrodict earlier readings, or impute missing readings. In this paper, we evaluate the Apperception Engine in a diverse variety of domains, including cellular automata, rhythms and simple nursery tunes, multi-modal binding problems, occlusion tasks, and sequence induction intelligence tests. In each domain, we test our engine's ability to predict future sensor values, retrodict earlier sensor values, and impute missing sensory data. The engine performs well in all these domains, significantly outperforming neural net baselines and state of the art inductive logic programming systems. These results are significant because neural nets typically struggle to solve the binding problem (where information from different modalities must somehow be combined together into different aspects of one unified object) and fail to solve occlusion tasks (in which objects are sometimes visible and sometimes obscured from view). We note in particular that in the sequence induction intelligence tests, our system achieved human-level performance. This is notable because our system is not a bespoke system designed specifically to solve intelligence tests, but a general-purpose system that was designed to make sense of any sensory sequence.

下载PDF全文

下载文献需遵守相关版权规定

论文标题