论文标题
光谱聚类与光滑的小簇
Spectral Clustering with Smooth Tiny Clusters
论文作者
论文摘要
光谱聚类是最突出的聚类方法之一。基于距离的相似性是光谱聚类最广泛使用的方法。但是,人们已经注意到,这不适合多尺度数据,因为距离对于具有不同密度的簇有很大差异。最新技术(ROSC和CAST)通过考虑对象的可及性相似性来解决这一限制。但是,我们观察到,在现实情况下,同一集群中的数据倾向于以平滑的方式显示,而以前的算法永远不会考虑到这一点。基于此观察结果,我们提出了一种新型的聚类算法,该算法首次介绍了数据的平滑度。我们首先将物体分为许多小簇。我们的关键想法是将小簇群集群集,其中心构成平滑的图。理论分析和实验结果表明,我们的聚类算法显着超过了艺术的状态。尽管在本文中,我们单独专注于多尺度情况,但数据平滑度的想法肯定可以扩展到任何聚类算法
Spectral clustering is one of the most prominent clustering approaches. The distance-based similarity is the most widely used method for spectral clustering. However, people have already noticed that this is not suitable for multi-scale data, as the distance varies a lot for clusters with different densities. State of the art(ROSC and CAST ) addresses this limitation by taking the reachability similarity of objects into account. However, we observe that in real-world scenarios, data in the same cluster tend to present in a smooth manner, and previous algorithms never take this into account. Based on this observation, we propose a novel clustering algorithm, which con-siders the smoothness of data for the first time. We first divide objects into a great many tiny clusters. Our key idea is to cluster tiny clusters, whose centers constitute smooth graphs. Theoretical analysis and experimental results show that our clustering algorithm significantly outperforms state of the art. Although in this paper, we singly focus on multi-scale situations, the idea of data smoothness can certainly be extended to any clustering algorithms