快速而缓慢的决策交织

论文标题

快速而缓慢的决策交织

Interleaving Fast and Slow Decision Making

论文作者

Gulati, Aditya, Soni, Sarthak, Rao, Shrisha

论文摘要

Kahneman的“思维，快速和慢”范式提出，我们使用两种不同的思维方式 - 快速，直观的系统1用于某些任务，以及对其他任务的较慢但更分析的系统2。虽然使用这种两组思维风格的想法是在AI和机器人技术中越来越受欢迎，但我们的作品考虑如何交织两种决策样式，即如何使用系统1和系统2一起使用。 For this, we propose a novel and general framework which includes a new System 0 to oversee Systems 1 and 2. At every point when a decision needs to be made, System 0 evaluates the situation and quickly hands over the decision-making process to either System 1 or System 2. We evaluate such a framework on a modified version of the classic Pac-Man game, with an already-trained RL algorithm for System 1, a Monte-Carlo tree search for System 2, and several different possible strategies for系统0。正如预期的那样，系统1和2之间的任意切换不起作用，但某些策略效果很好。使用System 0，代理能够比仅使用System 1或System 2的代理更好。

The "Thinking, Fast and Slow" paradigm of Kahneman proposes that we use two different styles of thinking -- a fast and intuitive System 1 for certain tasks, along with a slower but more analytical System 2 for others. While the idea of using this two-system style of thinking is gaining popularity in AI and robotics, our work considers how to interleave the two styles of decision-making, i.e., how System 1 and System 2 should be used together. For this, we propose a novel and general framework which includes a new System 0 to oversee Systems 1 and 2. At every point when a decision needs to be made, System 0 evaluates the situation and quickly hands over the decision-making process to either System 1 or System 2. We evaluate such a framework on a modified version of the classic Pac-Man game, with an already-trained RL algorithm for System 1, a Monte-Carlo tree search for System 2, and several different possible strategies for System 0. As expected, arbitrary switches between Systems 1 and 2 do not work, but certain strategies do well. With System 0, an agent is able to perform better than one that uses only System 1 or System 2.

下载PDF全文

下载文献需遵守相关版权规定

论文标题