论文标题
迈向多语言食谱个性化和建议
Towards Multi-Language Recipe Personalisation and Recommendation
论文作者
论文摘要
多语言配方个性化和建议是学术和生产系统中信息检索的探索领域。我们目前理解中的现有差距也很多,即使在基本问题上,例如是否可以跨语言提供一致和高质量的食谱建议。在本文中,我们介绍了多语言配方建议设置和目前的基础结果,这将有助于确定该领域未来工作的潜在和绝对价值。我们的作品借鉴了数百万种食谱和阿拉伯语,英语,印尼,俄语和西班牙语的几项活动。我们代表食谱,结合了归一化成分,标准化技能和图像嵌入的组合,而无需人工干预。在建模中,我们采用了一种基于优化嵌入式双线性用户项目度量空间的经典方法,以最大程度地引起烹饪意图的相互作用。对于没有交互历史的用户,引入了基于内容和食谱亲和力的定制基于内容的冷启动模型。我们表明,我们的个性化方法是稳定的,很容易扩展到新语言。采用了强大的交叉验证运动,并始终拒绝基线模型和表征,这非常喜欢我们提出的那些。我们的结果以面向语言的(而不是以模型为导向的)方式提出,以强调这项工作的基于语言的目标。我们认为,这是第一个全面考虑多语言配方建议和个性化以及提供可扩展可靠模型的价值和潜力的大型工作。
Multi-language recipe personalisation and recommendation is an under-explored field of information retrieval in academic and production systems. The existing gaps in our current understanding are numerous, even on fundamental questions such as whether consistent and high-quality recipe recommendation can be delivered across languages. In this paper, we introduce the multi-language recipe recommendation setting and present grounding results that will help to establish the potential and absolute value of future work in this area. Our work draws on several billion events from millions of recipes and users from Arabic, English, Indonesian, Russian, and Spanish. We represent recipes using a combination of normalised ingredients, standardised skills and image embeddings obtained without human intervention. In modelling, we take a classical approach based on optimising an embedded bi-linear user-item metric space towards the interactions that most strongly elicit cooking intent. For users without interaction histories, a bespoke content-based cold-start model that predicts context and recipe affinity is introduced. We show that our approach to personalisation is stable and easily scales to new languages. A robust cross-validation campaign is employed and consistently rejects baseline models and representations, strongly favouring those we propose. Our results are presented in a language-oriented (as opposed to model-oriented) fashion to emphasise the language-based goals of this work. We believe that this is the first large-scale work that comprehensively considers the value and potential of multi-language recipe recommendation and personalisation as well as delivering scalable and reliable models.