论文标题
Jacobian合奏提高了稳健性权衡,以对抗攻击
Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks
论文作者
论文摘要
深度神经网络已成为我们软件基础架构的组成部分,并已部署在许多广泛使用和安全关键的应用程序中。但是,它们整合到许多系统中也使它以通用对抗扰动(UAPS)的形式测试时间攻击的脆弱性。 UAP是一类扰动,当应用于任何输入时,导致模型错误分类。尽管持续努力捍卫模型免受这些对抗性攻击,但通常很难在模型的准确性和鲁棒性方面调和对抗性攻击的权衡。已证明雅各布式的正则化可以提高模型对UAP的鲁棒性,而模型集合已被广泛采用以提高预测性能和模型鲁棒性。在这项工作中,我们提出了一种新颖的方法,Jacobian合奏 - A Jacobian正则化和模型集合的组合,以显着提高对UAP的鲁棒性,同时维持或提高模型的准确性。我们的结果表明,Jacobian合奏达到了以前看不见的准确性和鲁棒性水平,从而大大改善了以前的方法,而这种方法倾向于仅偏向准确性或鲁棒性。
Deep neural networks have become an integral part of our software infrastructure and are being deployed in many widely-used and safety-critical applications. However, their integration into many systems also brings with it the vulnerability to test time attacks in the form of Universal Adversarial Perturbations (UAPs). UAPs are a class of perturbations that when applied to any input causes model misclassification. Although there is an ongoing effort to defend models against these adversarial attacks, it is often difficult to reconcile the trade-offs in model accuracy and robustness to adversarial attacks. Jacobian regularization has been shown to improve the robustness of models against UAPs, whilst model ensembles have been widely adopted to improve both predictive performance and model robustness. In this work, we propose a novel approach, Jacobian Ensembles-a combination of Jacobian regularization and model ensembles to significantly increase the robustness against UAPs whilst maintaining or improving model accuracy. Our results show that Jacobian Ensembles achieves previously unseen levels of accuracy and robustness, greatly improving over previous methods that tend to skew towards only either accuracy or robustness.