论文标题

结合常识性推理和知识获取,以指导机器人技术深入学习

Combining Commonsense Reasoning and Knowledge Acquisition to Guide Deep Learning in Robotics

论文作者

Sridharan, Mohan, Mota, Tiago

论文摘要

基于深层网络模型的算法已用于机器人技术和AI中的许多模式识别和决策任务。培训这些模型需要大量标记的数据集和相当大的计算资源,这些数据集在许多域中不容易获得。同样,很难探索这些模型的内部表示和推理机制。作为解决基本知识表示,推理和学习挑战的一步,本文所描述的结构从认知系统的研究中汲取了灵感。作为一个激励的例子,我们考虑了一个辅助机器人,试图通过推理对象的遮挡和对象配置在场景图像中的稳定性来减少混乱。在这种情况下,我们的体系结构会逐步学习并修改对象之间的空间关系的基础,并使用此基础从输入图像中提取空间信息。使用此信息和不完整的常识域知识的非单调逻辑推理用于做出有关稳定性和遮挡的决定。对于无法通过这种推理处理的图像,与手头任务相关的区域被自动识别并用于训练深层网络模型以做出所需的决策。用于训练深网的图像区域还用于逐步获得以前未知的状态约束,这些状态与现有知识合并以进行后续推理。使用模拟和现实世界图像进行的实验评估表明,与基于深层网络的基线相比,我们的体系结构可提高决策的可靠性,并减少培训数据驱动的深层网络模型所涉及的精力。

Algorithms based on deep network models are being used for many pattern recognition and decision-making tasks in robotics and AI. Training these models requires a large labeled dataset and considerable computational resources, which are not readily available in many domains. Also, it is difficult to explore the internal representations and reasoning mechanisms of these models. As a step towards addressing the underlying knowledge representation, reasoning, and learning challenges, the architecture described in this paper draws inspiration from research in cognitive systems. As a motivating example, we consider an assistive robot trying to reduce clutter in any given scene by reasoning about the occlusion of objects and stability of object configurations in an image of the scene. In this context, our architecture incrementally learns and revises a grounding of the spatial relations between objects and uses this grounding to extract spatial information from input images. Non-monotonic logical reasoning with this information and incomplete commonsense domain knowledge is used to make decisions about stability and occlusion. For images that cannot be processed by such reasoning, regions relevant to the tasks at hand are automatically identified and used to train deep network models to make the desired decisions. Image regions used to train the deep networks are also used to incrementally acquire previously unknown state constraints that are merged with the existing knowledge for subsequent reasoning. Experimental evaluation performed using simulated and real-world images indicates that in comparison with baselines based just on deep networks, our architecture improves reliability of decision making and reduces the effort involved in training data-driven deep network models.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源