论文标题

异质图

Heterogeneous Graphlets

论文作者

Rossi, Ryan A., Ahmed, Nesreen K., Carranza, Aldo, Arbour, David, Rao, Anup, Kim, Sungchul, Koh, Eunyee

论文摘要

在本文中,我们将图形的概括引入了称为类型Graphlets的异质网络。在非正式的情况下,键入的图形是小型诱导子图。键入的Graphlets将图形概括为丰富的异质网络,因为它们明确捕获了此类网络中的高阶键入连接模式。为了解决此问题,我们描述了计算此类类型图形的出现的一般框架。所提出的算法利用了许多类型图形的多种组合关系。对于每个边缘,我们对一些键入的图形进行计数,并且通过这些计数以及组合关系,我们在O(1)恒定时间中获得了其他键入的图形的确切计数。值得注意的是,所提出的方法的最差时间复杂性与最著名的未型算法的时间复杂性相匹配。另外,该方法将自己带入有效的无锁和异步并行实现。虽然没有现有的类型图形方法,但仍有一些工作重点是计算一个称为彩色graphlet的不同且更简单的概念。该实验证实,我们提出的方法是比计算彩色图形更简单概念的方法快,更高的阶数。与这些在小型网络上花费数小时的方法不同,所提出的方法仅需几秒钟就具有数百万边缘的大型网络。值得注意的是,由于键入的Graphlet比彩色graphlet(和未构图的graphlets)更通用,因此可以将各种键入图形的计数组合在一起,以获得更简单的彩色图形概念的计数。提出的方法为打字图形提供了新的机会和应用。

In this paper, we introduce a generalization of graphlets to heterogeneous networks called typed graphlets. Informally, typed graphlets are small typed induced subgraphs. Typed graphlets generalize graphlets to rich heterogeneous networks as they explicitly capture the higher-order typed connectivity patterns in such networks. To address this problem, we describe a general framework for counting the occurrences of such typed graphlets. The proposed algorithms leverage a number of combinatorial relationships for different typed graphlets. For each edge, we count a few typed graphlets, and with these counts along with the combinatorial relationships, we obtain the exact counts of the other typed graphlets in o(1) constant time. Notably, the worst-case time complexity of the proposed approach matches the time complexity of the best known untyped algorithm. In addition, the approach lends itself to an efficient lock-free and asynchronous parallel implementation. While there are no existing methods for typed graphlets, there has been some work that focused on computing a different and much simpler notion called colored graphlet. The experiments confirm that our proposed approach is orders of magnitude faster and more space-efficient than methods for computing the simpler notion of colored graphlet. Unlike these methods that take hours on small networks, the proposed approach takes only seconds on large networks with millions of edges. Notably, since typed graphlet is more general than colored graphlet (and untyped graphlets), the counts of various typed graphlets can be combined to obtain the counts of the much simpler notion of colored graphlets. The proposed methods give rise to new opportunities and applications for typed graphlets.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源