论文标题
更少的是:大型航空图像中有效的对象检测
Fewer is More: Efficient Object Detection in Large Aerial Images
论文作者
论文摘要
当前的大型航空图像的当前主流对象检测方法通常将大图像划分为贴片,然后在所有贴片上详尽地检测出感兴趣的对象,无论是否存在对象。该范式虽然有效,但效率很低,因为检测器必须经过所有斑块,从而严重阻碍了推理速度。本文提出了一个客观激活网络(OAN),以帮助检测器专注于更少的补丁,但获得了更有效的推断和更准确的结果,从而实现了一个简单有效的解决方案,以在大图像中进行对象检测。简而言之,OAN是一个光明的完全跨跨方向网络,用于判断每个贴片是否包含对象,可以轻松地集成到许多对象探测器中,并与它们端到端共同训练。我们用五个高级探测器广泛评估了我们的OAN。使用OAN,所有五个检测器都在三个大规模的空中图像数据集上获得了30.0%以上的加速,同时改进了一致的准确性。在极大的GaOfen-2图像(29200 $ \ times $ 27620像素)上,我们的OAN将检测速度提高了70.5%。此外,我们将OAN扩展到驾驶场景对象检测和4K视频对象检测,分别将检测速度提高112.1%和75.0%,而无需牺牲准确性。代码可在https://github.com/ranchosky/oan上找到。
Current mainstream object detection methods for large aerial images usually divide large images into patches and then exhaustively detect the objects of interest on all patches, no matter whether there exist objects or not. This paradigm, although effective, is inefficient because the detectors have to go through all patches, severely hindering the inference speed. This paper presents an Objectness Activation Network (OAN) to help detectors focus on fewer patches but achieve more efficient inference and more accurate results, enabling a simple and effective solution to object detection in large images. In brief, OAN is a light fully-convolutional network for judging whether each patch contains objects or not, which can be easily integrated into many object detectors and jointly trained with them end-to-end. We extensively evaluate our OAN with five advanced detectors. Using OAN, all five detectors acquire more than 30.0% speed-up on three large-scale aerial image datasets, meanwhile with consistent accuracy improvements. On extremely large Gaofen-2 images (29200$\times$27620 pixels), our OAN improves the detection speed by 70.5%. Moreover, we extend our OAN to driving-scene object detection and 4K video object detection, boosting the detection speed by 112.1% and 75.0%, respectively, without sacrificing the accuracy. Code is available at https://github.com/Ranchosky/OAN.