安全意识的多机学徒学习

论文标题

安全意识的多机学徒学习

Safety-Aware Multi-Agent Apprenticeship Learning

论文作者

Zhao, Junchen

论文摘要

该项目的目的是根据文章“安全意识的学徒学习”中提到的技术进行扩展，以提高现有强化学习模型的效用和效率，从单人学习框架到多项式学习框架。我们对该项目的贡献在以下要点：1。关于我们将从单个代理方案中添加逆增强学习模型的扩展名。我们对该项目的第一个贡献是考虑在多代理方案中从专家行为中提取安全奖励功能的情况，而不是从单个代理方案中提取安全奖励功能。 2。我们的第二个贡献将单格学习框架扩展到多机构学习框架，并根据最终扩展设计新颖的学习框架。 3。我们对该项目的最终贡献是从经验上评估我扩展到单一代理逆增强学习框架的表现。

Our objective of this project is to make the extension based on the technique mentioned in the paper "Safety-Aware Apprenticeship Learning" to improve the utility and the efficiency of the existing Reinforcement Learning model from a Single-Agent Learning framework to a Multi-Agent Learning framework. Our contributions to the project are presented in the following bullet points: 1. Regarding the fact that we will add an extension to the Inverse Reinforcement Learning model from a Single-Agent scenario to a Multi-Agentscenario. Our first contribution to this project is considering the case of extracting safe reward functions from expert behaviors in a Multi-Agent scenario instead of being from the Single-Agent scenario. 2. Our second contribution is extending the Single-Agent Learning Framework to a Multi-Agent Learning framework and designing a novel Learning Framework based on the extension in the end. 3. Our final contribution to this project is evaluating empirically the performance of my extension to the Single-Agent Inverse Reinforcement Learning framework.

下载PDF全文

下载文献需遵守相关版权规定

论文标题