机器语言的出现：使用神经网络迈向象征性智能

论文标题

机器语言的出现：使用神经网络迈向象征性智能

Emergence of Machine Language: Towards Symbolic Intelligence with Neural Networks

论文作者

Wang, Yuqi, Zhang, Xu-Yao, Liu, Cheng-Lin, Zhang, Zhaoxiang

论文摘要

表示是人工智能中的核心问题。人类使用离散的语言互相交流和学习，而机器使用连续的特征（例如深度神经网络中的向量，矩阵或张量）来表示认知模式。离散的符号是低维，脱钩的，具有强大的推理能力，而连续特征是高维，耦合的，并且具有令人难以置信的抽象功能。近年来，深度学习已经开发了连续代表到极端的想法，它使用数百万参数来实现高精度。尽管从统计角度来看，这是合理的，但它还有其他主要问题，例如缺乏可解释性，概括性差，并且很容易受到攻击。由于两个范式都有优势和劣势，因此更好的选择是寻求和解。在本文中，我们最初朝着这个方向迈进。具体而言，我们建议通过使用神经网络得出离散表示来结合象征主义和联系原理。这个过程与人类语言高度相似，人类语言是离散符号和神经系统的自然组合，其中大脑会处理连续信号并通过离散语言代表智能。为了模仿此功能，我们将方法表示为机器语言。通过设计互动环境和任务，我们证明了机器可以通过合作产生自发，灵活和语义的语言。此外，通过实验，我们表明，离散语言表示与连续特征表示相比，从可解释性，概括和鲁棒性的各个方面具有多个优点。

Representation is a core issue in artificial intelligence. Humans use discrete language to communicate and learn from each other, while machines use continuous features (like vector, matrix, or tensor in deep neural networks) to represent cognitive patterns. Discrete symbols are low-dimensional, decoupled, and have strong reasoning ability, while continuous features are high-dimensional, coupled, and have incredible abstracting capabilities. In recent years, deep learning has developed the idea of continuous representation to the extreme, using millions of parameters to achieve high accuracies. Although this is reasonable from the statistical perspective, it has other major problems like lacking interpretability, poor generalization, and is easy to be attacked. Since both paradigms have strengths and weaknesses, a better choice is to seek reconciliation. In this paper, we make an initial attempt towards this direction. Specifically, we propose to combine symbolism and connectionism principles by using neural networks to derive a discrete representation. This process is highly similar to human language, which is a natural combination of discrete symbols and neural systems, where the brain processes continuous signals and represents intelligence via discrete language. To mimic this functionality, we denote our approach as machine language. By designing an interactive environment and task, we demonstrated that machines could generate a spontaneous, flexible, and semantic language through cooperation. Moreover, through experiments we show that discrete language representation has several advantages compared with continuous feature representation, from the aspects of interpretability, generalization, and robustness.

下载PDF全文

下载文献需遵守相关版权规定

论文标题