论文标题
有效的图形桁架估计
Efficient Estimation of Graph Trussness
论文作者
论文摘要
$ k $ - truss是一个边缘引起的子图$ h $,因此其每个边缘至少属于$ h $ $ h $的$ k-2 $三角形。这一概念大约在十年前引入了社交网络分析和安全性,这是一种具有三角形且比集团更严格的凝聚力子图的形式。图的\ emph {桁架}是最大$ k $,因此存在$ k $ -truss。 从实际和工程的角度研究了计算$ k $ trusses的问题。另一方面,尽管提出了有趣的挑战,但问题的理论方面受到了较少的关注。现有方法共享一个共同的设计,基于迭代地卸下边缘的最小支撑,其中边缘的支撑是包含它的三角形的数量。 本文的目的是研究图形桁架的算法方面。虽然可以证明计算图形桁架的时间复杂性以及计数/列出所有三角形的时间复杂性本质上是相同的,但我们提供了有效的算法来估算其值,在适当的条件下,其复杂性明显低于确切方法。特别是,我们在包含$ω(m \,polylog(n))$三角形的图表上提供了比确切方法更快的$(1 \pmε)$ - 近似算法,并且在图上没有相同的运行时间。对于后一种情况,我们还表明,与三角形数为$ O(m)$相比,运行时间更快的近似算法是不可能的,除非有众所周知的猜想在三角形柔性和布尔矩阵乘法方面是错误的。
A $k$-truss is an edge-induced subgraph $H$ such that each of its edges belongs to at least $k-2$ triangles of $H$. This notion has been introduced around ten years ago in social network analysis and security, as a form of cohesive subgraph that is rich of triangles and less stringent than the clique. The \emph{trussness} of a graph is the maximum $k$ such that a $k$-truss exists. The problem of computing $k$-trusses has been largely investigated from the practical and engineering point of view. On the other hand, the theoretical side of the problem has received much less attention, despite presenting interesting challenges. The existing methods share a common design, based on iteratively removing the edge with smallest support, where the support of an edge is the number of triangles containing it. The aim of this paper is studying algorithmic aspects of graph trussness. While it is possible to show that the time complexity of computing exactly the graph trussness and that of counting/listing all triangles is inherently the same, we provide efficient algorithms for estimating its value, under suitable conditions, with significantly lower complexity than the exact approach. In particular, we provide a $(1 \pm ε)$-approximation algorithm that is asymptotically faster than the exact approach, on graphs which contain $ω(m \, polylog(n))$ triangles, and has the same running time on graphs that do not. For the latter case, we also show that it is impossible to obtain an approximation algorithm with faster running time than the one of the exact approach when the number of triangles is $O(m)$, unless well known conjectures on triangle-freeness and Boolean matrix multiplication are false.