论文标题
自动无人机赛车中有效神经网络培训的图像生成
Image Generation for Efficient Neural Network Training in Autonomous Drone Racing
论文作者
论文摘要
无人机赛车是一项休闲运动,其目标是在避免碰撞的同时,在最短的时间内通过一系列大门。在自主无人机赛车中,必须通过仅依靠计算机视觉方法来检测目标门来完全自动飞行,从而完成这项任务。由于背景对象和不同照明条件等挑战,基于颜色或几何形状的传统对象检测算法往往会失败。卷积神经网络在计算机视觉方面带来了令人印象深刻的进步,但需要大量的数据才能学习。收集这些数据是一个乏味的过程,因为必须手动飞行无人机,并且收集的数据可能会遭受传感器故障的损失。在这项工作中,提出了一种半合成数据集生成方法,使用了真正的背景图像和大门的随机3D渲染,以提供无限量的训练样本,而这些样本不会遭受这些缺陷的影响。使用检测结果,使用了视线引导算法来越过大门。在一些实验实时测试中,提出的框架成功证明了快速可靠的检测和导航。
Drone racing is a recreational sport in which the goal is to pass through a sequence of gates in a minimum amount of time while avoiding collisions. In autonomous drone racing, one must accomplish this task by flying fully autonomously in an unknown environment by relying only on computer vision methods for detecting the target gates. Due to the challenges such as background objects and varying lighting conditions, traditional object detection algorithms based on colour or geometry tend to fail. Convolutional neural networks offer impressive advances in computer vision but require an immense amount of data to learn. Collecting this data is a tedious process because the drone has to be flown manually, and the data collected can suffer from sensor failures. In this work, a semi-synthetic dataset generation method is proposed, using a combination of real background images and randomised 3D renders of the gates, to provide a limitless amount of training samples that do not suffer from those drawbacks. Using the detection results, a line-of-sight guidance algorithm is used to cross the gates. In several experimental real-time tests, the proposed framework successfully demonstrates fast and reliable detection and navigation.