论文标题
带有相似边缘的挖掘密集子图
Mining Dense Subgraphs with Similar Edges
论文作者
论文摘要
在图中搜索有趣的结构时,不仅要考虑图形连接性,还要考虑可用的元数据,例如节点和边缘标签或时间信息。在本文中,我们对使用这种元数据来定义边缘之间的相似性的设置感兴趣。我们考虑找到密度且边缘相对于给定相似性函数彼此相似的子图的问题。根据应用程序,此功能可以是例如边缘标签集之间的JACCARD相似性,或者在时间图中的边缘出现的时间相关性。我们制定了基于拉格朗日放松的优化问题,以搜索具有高成对边缘相似性的密集子图。我们设计了一种新颖的算法来通过参数mincut解决问题,并提供有效的搜索方案,以通过拉格朗日乘数的值进行迭代。我们的研究通过对现实世界数据集的评估进行了补充,该评估证明了拟议方法的有用性和效率。
When searching for interesting structures in graphs, it is often important to take into account not only the graph connectivity, but also the metadata available, such as node and edge labels, or temporal information. In this paper we are interested in settings where such metadata is used to define a similarity between edges. We consider the problem of finding subgraphs that are dense and whose edges are similar to each other with respect to a given similarity function. Depending on the application, this function can be, for example, the Jaccard similarity between the edge label sets, or the temporal correlation of the edge occurrences in a temporal graph. We formulate a Lagrangian relaxation-based optimization problem to search for dense subgraphs with high pairwise edge similarity. We design a novel algorithm to solve the problem through parametric MinCut, and provide an efficient search scheme to iterate through the values of the Lagrangian multipliers. Our study is complemented by an evaluation on real-world datasets, which demonstrates the usefulness and efficiency of the proposed approach.