论文标题
HRCF:通过双曲几何正规化增强协作过滤
HRCF: Enhancing Collaborative Filtering via Hyperbolic Geometric Regularization
论文作者
论文摘要
在大规模推荐系统中,用户项目网络通常不扩展或成倍扩展。用于描述用户和项目的潜在特征(也称为嵌入)取决于嵌入空间适合数据分布的程度。双曲线空间提供了一个宽敞的房间,可以通过其负曲率和度量特性学习嵌入,可以很好地拟合类似树状结构的数据。最近,已经提出了几种双曲线方法来学习用户和项目的高质量表示。但是,它们中的大多数集中于通过设计适当的投影操作来开发双曲线相似,而未经探索双曲线空间的许多有利且令人兴奋的几何特性。例如,双曲线空间最著名的特性之一是,其容量空间随半径呈指数增长,这表明远离双曲线起源的区域更容易嵌入。关于双曲线空间的几何特性,我们提出了双曲线正规化的协作滤波(HRCF),并设计了几何感知的双曲线正规剂。具体而言,该提案通过根对准和来源感知惩罚来提高优化程序,这很简单却令人印象深刻。通过理论分析,我们进一步表明,我们的建议能够解决由双曲线聚集引起的过度平滑问题,并使模型具有更好的判别能力。我们进行了广泛的经验分析,将我们的建议与几个公共基准的大量基准进行了比较。经验结果表明,我们的方法实现了高度的竞争性能,并通过相当大的利润超过了领先的欧几里得和双曲线基线。
In large-scale recommender systems, the user-item networks are generally scale-free or expand exponentially. The latent features (also known as embeddings) used to describe the user and item are determined by how well the embedding space fits the data distribution. Hyperbolic space offers a spacious room to learn embeddings with its negative curvature and metric properties, which can well fit data with tree-like structures. Recently, several hyperbolic approaches have been proposed to learn high-quality representations for the users and items. However, most of them concentrate on developing the hyperbolic similitude by designing appropriate projection operations, whereas many advantageous and exciting geometric properties of hyperbolic space have not been explicitly explored. For example, one of the most notable properties of hyperbolic space is that its capacity space increases exponentially with the radius, which indicates the area far away from the hyperbolic origin is much more embeddable. Regarding the geometric properties of hyperbolic space, we bring up a Hyperbolic Regularization powered Collaborative Filtering(HRCF) and design a geometric-aware hyperbolic regularizer. Specifically, the proposal boosts optimization procedure via the root alignment and origin-aware penalty, which is simple yet impressively effective. Through theoretical analysis, we further show that our proposal is able to tackle the over-smoothing problem caused by hyperbolic aggregation and also brings the models a better discriminative ability. We conduct extensive empirical analysis, comparing our proposal against a large set of baselines on several public benchmarks. The empirical results show that our approach achieves highly competitive performance and surpasses both the leading Euclidean and hyperbolic baselines by considerable margins.