决策算法评估的多模式关键方案生成

论文标题

决策算法评估的多模式关键方案生成

Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation

论文作者

Ding, Wenhao, Chen, Baiming, Li, Bo, Eun, Kim Ji, Zhao, Ding

论文摘要

现有的基于神经网络的自主系统被证明是针对对抗性攻击的脆弱性，因此对其鲁棒性进行了复杂的评估非常重要。但是，仅在基于已知攻击的最坏情况下评估鲁棒性并不全面，更不用说其中一些甚至很少发生在现实世界中。此外，关键数据的分布通常是多模式的，而大多数传统攻击和评估方法都集中在单一模态上。为了解决上述挑战，我们提出了一种基于流动的多模式安全 - 关键方案生成器，用于评估决策算法。提出的生成模型通过加权可能性最大化进行了优化，并集成了基于梯度的采样程序以提高采样效率。通过查询任务算法来生成安全 - 关键方案，生成的场景的对数可能与风险水平成比例。关于自动驾驶任务的实验证明了我们在测试效率和多模型建模能力方面的优势。我们通过生成的交通情况评估了六种强化学习算法，并就其稳健性提供了经验结论。

Existing neural network-based autonomous systems are shown to be vulnerable against adversarial attacks, therefore sophisticated evaluation on their robustness is of great importance. However, evaluating the robustness only under the worst-case scenarios based on known attacks is not comprehensive, not to mention that some of them even rarely occur in the real world. In addition, the distribution of safety-critical data is usually multimodal, while most traditional attacks and evaluation methods focus on a single modality. To solve the above challenges, we propose a flow-based multimodal safety-critical scenario generator for evaluating decisionmaking algorithms. The proposed generative model is optimized with weighted likelihood maximization and a gradient-based sampling procedure is integrated to improve the sampling efficiency. The safety-critical scenarios are generated by querying the task algorithms and the log-likelihood of the generated scenarios is in proportion to the risk level. Experiments on a self-driving task demonstrate our advantages in terms of testing efficiency and multimodal modeling capability. We evaluate six Reinforcement Learning algorithms with our generated traffic scenarios and provide empirical conclusions about their robustness.

下载PDF全文

下载文献需遵守相关版权规定

论文标题