论文标题

大规模产品图构建用于电子商务推荐

Large Scale Product Graph Construction for Recommendation in E-commerce

论文作者

Yang, Xiaoyong, Zhu, Yadong, Zhang, Yi, Wang, Xiaobo, Yuan, Quan

论文摘要

建立一个每天为数十亿用户服务的推荐系统是一个具有挑战性的问题,因为该系统需要根据O(1)时间复杂性的实时用户行为来对每秒进行天文数量的预测。这种大规模推荐系统通常很大程度上依赖于产品的预先索引来加速推荐服务,因此在线用户等待时间是不可能的。一个重要的索引结构是产品产品指数,其中可以检索给定种子产品的排名列表。该索引可以看作是加权产品产品图。在本文中,我们介绍了新型技术,以有效地构建这种索引产品图。特别是,我们建议使用秋千算法来捕获产品之间的替代关系,该算法可以利用用户项目的单击Bi-Partistive图。然后,我们提出了用于建模互补产品关系的惊喜算法,该算法利用产品类别信息并通过聚类技术解决了用户共购买图的稀疏问题。基于这两种方法,我们可以构建淘宝推荐的基本产品图。通过离线和在线实验对这些方法进行了全面评估,结果证明了工作的有效性和效率。

Building a recommendation system that serves billions of users on daily basis is a challenging problem, as the system needs to make astronomical number of predictions per second based on real-time user behaviors with O(1) time complexity. Such kind of large scale recommendation systems usually rely heavily on pre-built index of products to speedup the recommendation service so that online user waiting time is un-noticeable. One important indexing structure is the product-product index, where one can retrieval a list of ranked products given a seed product. The index can be viewed as a weighted product-product graph. In this paper, we present our novel technologies to efficiently build such kind of indexed product graphs. In particular, we propose the Swing algorithm to capture the substitute relationships between products, which can utilize the substructures of user-item click bi-partitive graph. Then we propose the Surprise algorithm for the modeling of complementary product relationships, which utilizes product category information and solves the sparsity problem of user co-purchasing graph via clustering technique. Base on these two approaches, we can build the basis product graph for recommendation in Taobao. The approaches are evaluated comprehensively with both offline and online experiments, and the results demonstrate the effectiveness and efficiency of the work.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源