论文标题
内核反转金字塔调整大小网络,以提供有效的路面遇险识别
Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition
论文作者
论文摘要
路面遇险识别(PDR)是路面检查中的重要一步,可以通过基于图像的自动化来驱动以加快流程并降低人工成本。路面图像通常是高分辨率的,苦恼与未分配区域的比例较低。高级方法通过将图像分为补丁并探索秤空间中的歧视特征来利用这些属性。但是,这些方法通常在图像调整大小和由于复杂的学习框架引起的效率低时会遭受信息丢失。在本文中,我们提出了一种新颖有效的PDR方法。引入了一个名为内核的锥体锥体调整网络(KIPRN)的光网络,以进行图像调整,并可以灵活地插入图像分类网络中,以作为利用分辨率和规模信息的预网络。在基普恩(Kiprn)中,锥体卷积和内核反卷积是专门设计的,用于跨不同特征粒度和尺度挖掘歧视性信息。挖掘的信息将传递到调整大小的图像中,以产生信息丰富的图像金字塔,以帮助PDR的图像分类网络。我们将方法应用于三个众所周知的卷积神经网络(CNN),并对名为CQU-BPDD的大规模路面图像数据集进行了评估。广泛的结果表明,KIPRN通常可以改善这些CNN模型的路面遇险识别,并表明KIPRN和EficitedNet-B3的简单组合在性能和效率方面显着胜过基于最先进的斑块方法。
Pavement Distress Recognition (PDR) is an important step in pavement inspection and can be powered by image-based automation to expedite the process and reduce labor costs. Pavement images are often in high-resolution with a low ratio of distressed to non-distressed areas. Advanced approaches leverage these properties via dividing images into patches and explore discriminative features in the scale space. However, these approaches usually suffer from information loss during image resizing and low efficiency due to complex learning frameworks. In this paper, we propose a novel and efficient method for PDR. A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing, and can be flexibly plugged into the image classification network as a pre-network to exploit resolution and scale information. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information across different feature granularities and scales. The mined information is passed along to the resized images to yield an informative image pyramid to assist the image classification network for PDR. We applied our method to three well-known Convolutional Neural Networks (CNNs), and conducted an evaluation on a large-scale pavement image dataset named CQU-BPDD. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of these CNN models and show that the simple combination of KIPRN and EfficientNet-B3 significantly outperforms the state-of-the-art patch-based method in both performance and efficiency.