深入研究Convnets的抗氧化

论文标题

深入研究Convnets的抗氧化

Delving Deeper into Anti-aliasing in ConvNets

论文作者

Zou, Xueyan, Xiao, Fanyi, Yu, Zhiding, Lee, Yong Jae

论文摘要

混叠是指高频信号在抽样后退化为完全不同的现象。在深度学习的背景下，它是一个问题，因为在深度体系结构中广泛采用了缩减采样层以减少参数和计算。标准解决方案是在下采样之前应用低通滤波器（例如，高斯模糊）。但是，在整个内容上应用相同的过滤器可能是次优的，因为特征地图的频率在空间位置和特征通道之间都可能有所不同。为了解决这个问题，我们提出了一个自适应内容感知的低通滤波层，该层可预测每个空间位置和输入特征图的通道组的单独滤波重量。我们研究了跨多个任务的拟议方法的有效性和概括，包括成像网分类，可可实例分割和城市景观语义分割。定性和定量结果表明，我们的方法有效地适应了不同的特征频率，以避免在保留有用的信息以识别的同时混音。代码可在https://maureenzou.github.io/ddac/上找到。

Aliasing refers to the phenomenon that high frequency signals degenerate into completely different ones after sampling. It arises as a problem in the context of deep learning as downsampling layers are widely adopted in deep architectures to reduce parameters and computation. The standard solution is to apply a low-pass filter (e.g., Gaussian blur) before downsampling. However, it can be suboptimal to apply the same filter across the entire content, as the frequency of feature maps can vary across both spatial locations and feature channels. To tackle this, we propose an adaptive content-aware low-pass filtering layer, which predicts separate filter weights for each spatial location and channel group of the input feature maps. We investigate the effectiveness and generalization of the proposed method across multiple tasks including ImageNet classification, COCO instance segmentation, and Cityscapes semantic segmentation. Qualitative and quantitative results demonstrate that our approach effectively adapts to the different feature frequencies to avoid aliasing while preserving useful information for recognition. Code is available at https://maureenzou.github.io/ddac/.

下载PDF全文

下载文献需遵守相关版权规定

论文标题