论文标题
相关图匹配的测量方法的浓度
A Concentration of Measure Approach to Correlated Graph Matching
论文作者
论文摘要
图形匹配问题自然出现在Web隐私,图像处理和计算生物学等各种应用中。在本文中,在随机模型下考虑了图形匹配,其中一对具有成对相关边缘的随机生成的图形匹配,以便给定第一个图中顶点的标记,第二个图中的标签通过在其边缘之间利用相关性来恢复第二个图中的标签。在各种设置和图形模型下考虑了问题。在第一步中,研究了相关的Erdös-rényi(CER)图模型,其中所有边缘对的顶点具有相似的标签,这是基于相同的分布而独立于其他边缘生成的。引入了称为\ textit {典型性匹配方案}的匹配方案。该方案通过研究两个图的邻接矩阵的典型性。关于序列排列的典型性结果的新结果导致基于CER模型的参数成功匹配的必要条件。在下一步中,结果将扩展到基于随机块模型(SBM)生成的社区结构的图表。 SBM模型是CER模型的概括,其中图中的每个顶点都与社区标签相关联,这会影响其边缘统计。结果进一步扩展到了两个以上相关图的集合的匹配。最后,研究了种子图匹配的问题,其中第二个图中的标签子集在匹配之前是已知的。在这种情况下,除了获得成功匹配的必要条件外,还提出了多个匹配算法。
The graph matching problem emerges naturally in various applications such as web privacy, image processing and computational biology. In this paper, graph matching is considered under a stochastic model, where a pair of randomly generated graphs with pairwise correlated edges are to be matched such that given the labeling of the vertices in the first graph, the labels in the second graph are recovered by leveraging the correlation among their edges. The problem is considered under various settings and graph models. In the first step, the Correlated Erdös-Rényi (CER) graph model is studied, where all edge pairs whose vertices have similar labels are generated based on identical distributions and independently of other edges. A matching scheme called the \textit{typicality matching scheme} is introduced. The scheme operates by investigating the joint typicality of the adjacency matrices of the two graphs. New results on the typicality of permutations of sequences lead to necessary and sufficient conditions for successful matching based on the parameters of the CER model. In the next step, the results are extended to graphs with community structure generated based on the Stochastic Block Model (SBM). The SBM model is a generalization of the CER model where each vertex in the graph is associated with a community label, which affects its edge statistics. The results are further extended to matching of ensembles of more than two correlated graphs. Lastly, the problem of seeded graph matching is investigated where a subset of the labels in the second graph are known prior to matching. In this scenario, in addition to obtaining necessary and sufficient conditions for successful matching, a polytime matching algorithm is proposed.