论文标题
概率人群gan:使用图形培训的多模式行人轨迹预测
Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction using a Graph Vehicle-Pedestrian Attention Network
论文作者
论文摘要
理解和预测行人的意图对于使自动驾驶汽车和移动机器人驾驶人群至关重要。当我们考虑行人运动的不确定性和多模式以及人群成员之间的隐式相互作用(包括对车辆的任何反应)时,这个问题变得越来越复杂。我们的概率人群GAN的方法扩展了轨迹预测的最新工作,将复发性神经网络(RNN)与混合物密度网络(MDN)相结合到输出概率的多模式预测,从中可能发现并使用模态路径,并用于对抗性训练。我们还建议使用图形驾驶员注意力网络(GVAT),该网络对社交相互作用进行建模并允许输入共享车辆功能,这表明包含该模块会导致有和不存在车辆的情况下改善轨迹预测。通过对各种数据集的评估,我们证明了对轨迹预测的最新方法的改进,并说明了如何直接建模人群相互作用的真实多模式和不确定性质。
Understanding and predicting the intention of pedestrians is essential to enable autonomous vehicles and mobile robots to navigate crowds. This problem becomes increasingly complex when we consider the uncertainty and multimodality of pedestrian motion, as well as the implicit interactions between members of a crowd, including any response to a vehicle. Our approach, Probabilistic Crowd GAN, extends recent work in trajectory prediction, combining Recurrent Neural Networks (RNNs) with Mixture Density Networks (MDNs) to output probabilistic multimodal predictions, from which likely modal paths are found and used for adversarial training. We also propose the use of Graph Vehicle-Pedestrian Attention Network (GVAT), which models social interactions and allows input of a shared vehicle feature, showing that inclusion of this module leads to improved trajectory prediction both with and without the presence of a vehicle. Through evaluation on various datasets, we demonstrate improvements on the existing state of the art methods for trajectory prediction and illustrate how the true multimodal and uncertain nature of crowd interactions can be directly modelled.