论文标题
没有目标域真实图像的深度交通标志检测和识别
Deep Traffic Sign Detection and Recognition Without Target Domain Real Images
论文作者
论文摘要
深度学习已成功地应用于与自主驾驶有关的几个问题,通常依靠真正的目标域图像的大型数据库进行适当的培训。在自动驾驶的情况下,并非总是可以获取此类现实数据,有时它们的注释是不可行的。此外,在许多任务中,大多数基于学习的方法都难以应对。特别是,交通标志检测是一个具有挑战性的问题,在该问题中,这三个问题被完全看到。为了应对这些挑战,我们提出了一种新颖的数据库生成方法,该方法仅需要(i)任意自然图像,即不需要目标域中的真实图像,以及(ii)流量标志的模板。该方法不是旨在用实际数据克服培训,而是在没有真实数据时成为兼容的替代方案。毫不费力地生成的数据库可有效地培训来自多个国家的交通标志的深度探测器。在大型数据集中,使用完全合成数据集的培训几乎与培训的性能与真实的培训相匹配。与使用较小的真实图像的训练相比,使用合成图像的训练将精度提高了12.25%。当目标域数据可用时,提出的方法还可以改善检测器的性能。
Deep learning has been successfully applied to several problems related to autonomous driving, often relying on large databases of real target-domain images for proper training. The acquisition of such real-world data is not always possible in the self-driving context, and sometimes their annotation is not feasible. Moreover, in many tasks, there is an intrinsic data imbalance that most learning-based methods struggle to cope with. Particularly, traffic sign detection is a challenging problem in which these three issues are seen altogether. To address these challenges, we propose a novel database generation method that requires only (i) arbitrary natural images, i.e., requires no real image from the target-domain, and (ii) templates of the traffic signs. The method does not aim at overcoming the training with real data, but to be a compatible alternative when the real data is not available. The effortlessly generated database is shown to be effective for the training of a deep detector on traffic signs from multiple countries. On large data sets, training with a fully synthetic data set almost matches the performance of training with a real one. When compared to training with a smaller data set of real images, training with synthetic images increased the accuracy by 12.25%. The proposed method also improves the performance of the detector when target-domain data are available.