论文标题
图形结构上的梯度在灰色框中可靠吗?
Are Gradients on Graph Structure Reliable in Gray-box Attacks?
论文作者
论文摘要
图边缘扰动致力于通过修改图形结构来损害图神经网络的预测。以前的灰色框攻击者采用替代模型的梯度来定位脆弱的边缘以扰动图形结构。但是,图形结构上的梯度存在不可靠性,这是先前工作很少研究的。在本文中,我们讨论并分析了由结构梯度的不可靠性引起的错误。这些误差是由于图形结构的离散性以及图形结构上的元梯度的不可靠性引起的粗糙梯度使用。为了解决这些问题,我们提出了一种新型攻击模型,该模型采用减少结构梯度内部错误的方法。我们提出Edge离散抽样以选择与分层候选选择相关的边缘扰动,以确保计算效率。此外,还提出了语义不变性和动量梯度集合,以解决语义增强图上的梯度波动和替代模型的不稳定性。实验是在未靶向的灰色盒中毒场景中进行的,并证明了我们方法的性能的改善。
Graph edge perturbations are dedicated to damaging the prediction of graph neural networks by modifying the graph structure. Previous gray-box attackers employ gradients from the surrogate model to locate the vulnerable edges to perturb the graph structure. However, unreliability exists in gradients on graph structures, which is rarely studied by previous works. In this paper, we discuss and analyze the errors caused by the unreliability of the structural gradients. These errors arise from rough gradient usage due to the discreteness of the graph structure and from the unreliability in the meta-gradient on the graph structure. In order to address these problems, we propose a novel attack model with methods to reduce the errors inside the structural gradients. We propose edge discrete sampling to select the edge perturbations associated with hierarchical candidate selection to ensure computational efficiency. In addition, semantic invariance and momentum gradient ensemble are proposed to address the gradient fluctuation on semantic-augmented graphs and the instability of the surrogate model. Experiments are conducted in untargeted gray-box poisoning scenarios and demonstrate the improvement in the performance of our approach.