自动驾驶汽车的道路交通法自适应决策

论文标题

自动驾驶汽车的道路交通法自适应决策

Road Traffic Law Adaptive Decision-making for Self-Driving Vehicles

论文作者

Liu, Jiaxin, Zhou, Wenhui, Wang, Hong, Cao, Zhong, Yu, Wenhao, Zhao, Chengxiang, Zhao, Ding, Yang, Diange, Li, Jun

论文摘要

自动驾驶汽车有自己的智能在开放道路上行驶。但是，车辆经理，例如政府或工业公司，仍然需要一种方法来告诉这些自动驾驶汽车，鼓励或禁止采取什么行为。与人类驾驶员不同，当前的自动驾驶车辆无法理解交通法，因此依靠程序员手动将相应的原理编写到驾驶系统中。适应一些临时交通法律的效率和难以提高，尤其是当车辆使用数据驱动的决策算法时。此外，当前的自动驾驶车辆系统很少考虑交通法则的修改。这项工作旨在设计道路交通法自适应决策方法。决策算法是基于强化学习设计的，其中通常在深层神经网络中隐式编码流量规则。主要思想是通过法律自动备份政策为自动驾驶车辆的交通法律提供适应性。在这项工作中，基于自然语言的交通法律首先通过线性时间逻辑方法转化为逻辑表达式。然后，该系统将尝试提前监视自动驾驶车辆是否可以通过设计长期RL动作空间来破坏交通法律。最后，基于样本的计划方法将在车辆破坏交通规则时重新计划轨迹。该方法在北京冬季奥林匹克车道的场景中得到了验证，并且是在卡拉模拟器中建造的超车案例。结果表明，通过采用这种方法，自动驾驶汽车可以有效地遵守新的发行或更新的交通法律。这种方法有助于由数字交通法律管理的自动驾驶车辆，这对于广泛采用自动驾驶所必需的必要。

Self-driving vehicles have their own intelligence to drive on open roads. However, vehicle managers, e.g., government or industrial companies, still need a way to tell these self-driving vehicles what behaviors are encouraged or forbidden. Unlike human drivers, current self-driving vehicles cannot understand the traffic laws, thus rely on the programmers manually writing the corresponding principles into the driving systems. It would be less efficient and hard to adapt some temporary traffic laws, especially when the vehicles use data-driven decision-making algorithms. Besides, current self-driving vehicle systems rarely take traffic law modification into consideration. This work aims to design a road traffic law adaptive decision-making method. The decision-making algorithm is designed based on reinforcement learning, in which the traffic rules are usually implicitly coded in deep neural networks. The main idea is to supply the adaptability to traffic laws of self-driving vehicles by a law-adaptive backup policy. In this work, the natural language-based traffic laws are first translated into a logical expression by the Linear Temporal Logic method. Then, the system will try to monitor in advance whether the self-driving vehicle may break the traffic laws by designing a long-term RL action space. Finally, a sample-based planning method will re-plan the trajectory when the vehicle may break the traffic rules. The method is validated in a Beijing Winter Olympic Lane scenario and an overtaking case, built in CARLA simulator. The results show that by adopting this method, the self-driving vehicles can comply with new issued or updated traffic laws effectively. This method helps self-driving vehicles governed by digital traffic laws, which is necessary for the wide adoption of autonomous driving.

下载PDF全文

下载文献需遵守相关版权规定

论文标题