论文标题
使用配置模型总结图
Summarizing graphs using the configuration model
论文作者
论文摘要
给定一个大图,我们如何在维护其关键属性(例如Spectral属性)的同时以更少的节点和边缘进行总结?尽管图表在许多实际应用中扮演着越来越重要的角色,但其大小的增长给图形分析带来了巨大的挑战。作为解决方案,旨在找到保留给定图的重要特性的紧凑表示的图形摘要已引起了很多关注,并且已经为其开发了许多算法。但是,大多数算法采用统一的重建方案,该方案基于一个不切实际的假设,即边缘均匀分布。在这项工作中,我们提出了一种新颖而现实的重建方案,该方案保留了节点的程度,并根据最小描述长度原理开发了一种有效的图形摘要算法,称为DPGS。从理论上讲,我们从光谱的角度分析了原始图和摘要图之间的差异,并在多个现实世界数据集上进行了广泛的实验。结果表明,DPG产生紧凑的表示,可保留原始图的基本特性。
Given a large graph, how can we summarize it with fewer nodes and edges while maintaining its key properties, such as spectral property? Although graphs play more and more important roles in many real-world applications, the growth of their size presents great challenges to graph analysis. As a solution, graph summarization, which aims to find a compact representation that preserves the important properties of a given graph, has received much attention, and numerous algorithms have been developed for it. However, most of the algorithms adopt the uniform reconstruction scheme, which is based on an unrealistic assumption that edges are uniformly distributed. In this work, we propose a novel and realistic reconstruction scheme, which preserves the degree of nodes, and we develop an efficient graph summarization algorithm called DPGS based on the Minimum Description Length principle. We theoretically analyze the difference between the original and summary graphs from a spectral perspective, and we perform extensive experiments on multiple real-world datasets. The results show that DPGS yields compact representation that preserves the essential properties of the original graph.