出处图内核

论文标题

Provenance Graph Kernel

论文作者

Marzagão, David Kohan, Huynh, Trung Dong, Helal, Ayah, Baccas, Sean, Moreau, Luc

论文摘要

出处是描述实体，活动和代理商如何影响数据的记录；它通常表示为具有相关标签的图形，并在其节点和边缘上都表示。随着在广泛的应用程序域中的出处采用越来越多，用户越来越多地面临着大量的图形数据，这可能会挑战处理。另一方面，图形内核已成功地用于有效分析图。在本文中，我们介绍了一个名为permantagnion内核的新颖图形内核，该内核的灵感来自于出处数据和量身定制。它将出处图分解为植根于给定节点的树状图案，并将边缘和节点的标签视为距离根的一定距离。我们采用出处内核来对三个应用程序域的出处进行分类。我们的评估表明，与现有图形内核方法和出处网络分析方法相比，它们在分类准确性方面的表现很好，并且在计算时间更有效时产生了竞争结果。此外，出处内核使用的出处类型还有助于提高构建其构建的预测模型的解释性。

Provenance is a record that describes how entities, activities, and agents have influenced a piece of data; it is commonly represented as graphs with relevant labels on both their nodes and edges. With the growing adoption of provenance in a wide range of application domains, users are increasingly confronted with an abundance of graph data, which may prove challenging to process. Graph kernels, on the other hand, have been successfully used to efficiently analyse graphs. In this paper, we introduce a novel graph kernel called provenance kernel, which is inspired by and tailored for provenance data. It decomposes a provenance graph into tree-patterns rooted at a given node and considers the labels of edges and nodes up to a certain distance from the root. We employ provenance kernels to classify provenance graphs from three application domains. Our evaluation shows that they perform well in terms of classification accuracy and yield competitive results when compared against existing graph kernel methods and the provenance network analytics method while more efficient in computing time. Moreover, the provenance types used by provenance kernels also help improve the explainability of predictive models built on them.

下载PDF全文

下载文献需遵守相关版权规定

论文标题