论文标题
增强基于神经网络的欺诈探测器针对伪装欺诈者
Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
论文作者
论文摘要
近年来,图形神经网络(GNN)已被广泛应用于欺诈检测问题,通过通过不同的关系汇总其邻居信息,揭示了节点的可疑性。但是,很少有事实注意到欺诈者的伪装行为,这可能会阻碍基于GNN的欺诈探测器在聚合过程中的性能。在本文中,我们根据最近的经验研究介绍了两种类型的伪装,即特征伪装和关系伪装。现有的GNN尚未解决这两种伪装,这导致他们在欺诈检测问题方面的表现不佳。另外,我们提出了一个名为耐伪装的GNN(CARE-GNN)的新模型,以通过针对迷彩的三个独特模块来增强GNN聚合过程。具体而言,我们首先设计了一种标签感知的相似性度量,以找到信息丰富的相邻节点。然后,我们利用强化学习(RL)找到要选择的邻居的最佳数量。最后,跨不同关系的选定邻居聚集在一起。对两个现实世界欺诈数据集进行的全面实验证明了RL算法的有效性。拟议的Care-GNN还胜过最先进的GNN和基于GNN的欺诈探测器。我们将所有基于GNN的欺诈探测器集成为OpenSource工具箱:https://github.com/safe-graph/dgfraud。 CARE-GNN代码和数据集可在https://github.com/yingtongdou/care-gnn上找到。
Graph Neural Networks (GNNs) have been widely applied to fraud detection problems in recent years, revealing the suspiciousness of nodes by aggregating their neighborhood information via different relations. However, few prior works have noticed the camouflage behavior of fraudsters, which could hamper the performance of GNN-based fraud detectors during the aggregation process. In this paper, we introduce two types of camouflages based on recent empirical studies, i.e., the feature camouflage and the relation camouflage. Existing GNNs have not addressed these two camouflages, which results in their poor performance in fraud detection problems. Alternatively, we propose a new model named CAmouflage-REsistant GNN (CARE-GNN), to enhance the GNN aggregation process with three unique modules against camouflages. Concretely, we first devise a label-aware similarity measure to find informative neighboring nodes. Then, we leverage reinforcement learning (RL) to find the optimal amounts of neighbors to be selected. Finally, the selected neighbors across different relations are aggregated together. Comprehensive experiments on two real-world fraud datasets demonstrate the effectiveness of the RL algorithm. The proposed CARE-GNN also outperforms state-of-the-art GNNs and GNN-based fraud detectors. We integrate all GNN-based fraud detectors as an opensource toolbox: https://github.com/safe-graph/DGFraud. The CARE-GNN code and datasets are available at https://github.com/YingtongDou/CARE-GNN.