基于决策的通用对手攻击

论文标题

基于决策的通用对手攻击

Decision-based Universal Adversarial Attack

论文作者

Wu, Jing, Zhou, Mingyi, Liu, Shuaicheng, Liu, Yipeng, Zhu, Ce

论文摘要

单个扰动可能构成最自然的图像，该图像被分类器错误分类。在Black-Box设置中，当前的通用对抗攻击方法利用替代模型来生成扰动，然后将扰动应用于攻击模型。但是，这种转移通常会产生较低的结果。在这项研究中，我们直接在黑盒环境中工作，以产生通用的对抗扰动。此外，我们旨在设计一个对手，该对手产生一个基于正交矩阵的条纹等质地的扰动，因为顶部卷积层对条纹敏感。为此，我们提出了一个有效的基于决策的通用攻击（DUATTACK）。由于数据很少，因此提出的对手仅根据最终推断的标签计算扰动，但不仅在模型之间实现了良好的可传递性，而且还跨越了不同的视觉任务。通过与其他最新攻击的比较来验证Duattack的有效性。在包括Microsoft Azure在内的现实世界环境中，还证明了Duattack的效率。此外，几种代表性的防御方法正在与杜塔克（Duattack）挣扎，表明该方法的实用性。

A single perturbation can pose the most natural images to be misclassified by classifiers. In black-box setting, current universal adversarial attack methods utilize substitute models to generate the perturbation, then apply the perturbation to the attacked model. However, this transfer often produces inferior results. In this study, we directly work in the black-box setting to generate the universal adversarial perturbation. Besides, we aim to design an adversary generating a single perturbation having texture like stripes based on orthogonal matrix, as the top convolutional layers are sensitive to stripes. To this end, we propose an efficient Decision-based Universal Attack (DUAttack). With few data, the proposed adversary computes the perturbation based solely on the final inferred labels, but good transferability has been realized not only across models but also span different vision tasks. The effectiveness of DUAttack is validated through comparisons with other state-of-the-art attacks. The efficiency of DUAttack is also demonstrated on real world settings including the Microsoft Azure. In addition, several representative defense methods are struggling with DUAttack, indicating the practicability of the proposed method.

下载PDF全文

下载文献需遵守相关版权规定

论文标题