未信号的道路交叉口的实时合作车辆协调

论文标题

未信号的道路交叉口的实时合作车辆协调

Real-time Cooperative Vehicle Coordination at Unsignalized Road Intersections

论文作者

Luo, Jiping, Zhang, Tingting, Hao, Rui, Li, Donglin, Chen, Chunsheng, Na, Zhenyu, Zhang, Qinyu

论文摘要

旨在改善连接和自动化车辆的驾驶安全性和交通吞吐量的未信号的道路交叉口的合作协调吸引了近年来的兴趣。但是，大多数现有的调查要么具有计算复杂性，要么无法利用道路基础设施的全部潜力。为此，我们首先提出了一个专用的交叉协调框架，其中所涉及的车辆将移交其控制当局并遵循集中协调员的指示。然后，将制定统一的合作轨迹优化问题，以最大程度地提高流量吞吐量，同时确保协调系统的驾驶安全性和长期稳定性。为了应对现实世界部署的关键计算挑战，我们将这个非凸的顺序决策问题重新将其重新制定为无模型的马尔可夫决策过程（MDP），并通过设计基于深度强化学习（DRL）框架的双重延迟的深层确定性策略梯度（TD3）策略来解决它。仿真和实践实验表明，所提出的策略可以在亚静态协调方案中实现近乎最佳的性能，并显着改善现实连续交通流量中的交通吞吐量。最显着的优势是，我们的策略可以减少计算时间的复杂性，并在道路车道增加时显示可扩展。

Cooperative coordination at unsignalized road intersections, which aims to improve the driving safety and traffic throughput for connected and automated vehicles, has attracted increasing interests in recent years. However, most existing investigations either suffer from computational complexity or cannot harness the full potential of the road infrastructure. To this end, we first present a dedicated intersection coordination framework, where the involved vehicles hand over their control authorities and follow instructions from a centralized coordinator. Then a unified cooperative trajectory optimization problem will be formulated to maximize the traffic throughput while ensuring the driving safety and long-term stability of the coordination system. To address the key computational challenges in the real-world deployment, we reformulate this non-convex sequential decision problem into a model-free Markov Decision Process (MDP) and tackle it by devising a Twin Delayed Deep Deterministic Policy Gradient (TD3)-based strategy in the deep reinforcement learning (DRL) framework. Simulation and practical experiments show that the proposed strategy could achieve near-optimal performance in sub-static coordination scenarios and significantly improve the traffic throughput in the realistic continuous traffic flow. The most remarkable advantage is that our strategy could reduce the time complexity of computation to milliseconds, and is shown scalable when the road lanes increase.

下载PDF全文

下载文献需遵守相关版权规定

论文标题