将先验知识集成到混合计划中的社交网络集群中

论文标题

将先验知识集成到混合计划中的社交网络集群中

Integrating Prior Knowledge in Mixed Initiative Social Network Clustering

论文作者

Pister, Alexis, Buono, Paolo, Fekete, Jean-Daniel, Plaisant, Catherine, Valdivia, Paola

论文摘要

我们提出了一种新方法（称为PK群集），以帮助社会科学家在社交网络中创建有意义的集群。存在许多聚类算法，但大多数社会科学家都发现它们难以理解，工具没有提供任何选择算法的指导，或者考虑了科学家的先验知识，可以评估结果。我们的工作介绍了一种新的聚类方法和一个可视化该问题的视觉分析用户界面。 It is based on a process that 1) captures the prior knowledge of the scientists as a set of incomplete clusters, 2) runs multiple clustering algorithms (similarly to clustering ensemble methods), 3) visualizes the results of all the algorithms ranked and summarized by how well each algorithm matches the prior knowledge, 4) evaluates the consensus between user-selected algorithms, and 5) allows users to review details并迭代地更新获得的知识。我们使用初始功能原型描述了我们的方法，然后提供了两个使用的示例和社会科学家的早期反馈。我们认为，我们的聚类方法为迭代构建知识提供了一种新颖的建设性方法，同时避免了经常被随机选择的黑盒聚类算法的结果过度影响。

We propose a new approach -- called PK-clustering -- to help social scientists create meaningful clusters in social networks. Many clustering algorithms exist but most social scientists find them difficult to understand, and tools do not provide any guidance to choose algorithms, or to evaluate results taking into account the prior knowledge of the scientists. Our work introduces a new clustering approach and a visual analytics user interface that address this issue. It is based on a process that 1) captures the prior knowledge of the scientists as a set of incomplete clusters, 2) runs multiple clustering algorithms (similarly to clustering ensemble methods), 3) visualizes the results of all the algorithms ranked and summarized by how well each algorithm matches the prior knowledge, 4) evaluates the consensus between user-selected algorithms, and 5) allows users to review details and iteratively update the acquired knowledge. We describe our approach using an initial functional prototype, then provide two examples of use and early feedback from social scientists. We believe our clustering approach offers a novel constructive method to iteratively build knowledge while avoiding being overly influenced by the results of often randomly selected black-box clustering algorithms.

下载PDF全文

下载文献需遵守相关版权规定

论文标题