论文标题
学习搜索查询的颜色表示
Learning Colour Representations of Search Queries
论文作者
论文摘要
图像搜索引擎依靠适当设计的排名功能,这些功能捕获了内容语义的各个方面以及历史上的受欢迎程度。在这项工作中,我们考虑了颜色在相关性匹配过程中的作用。我们的工作是通过观察到的,即大量用户查询具有与之相关的固有颜色。虽然一些查询包含明确的颜色提及(例如“黑色汽车”和“黄色雏菊”),但其他查询具有隐性的颜色概念(例如“天空”和“草”)。此外,颜色的接地查询不是单一颜色的映射,而是颜色空间中的分布。例如,对“树”的搜索往往在绿色和棕色的颜色周围具有双峰分布。我们利用历史点击数据来生成搜索查询的颜色表示形式,并提出一个经常性的神经网络体系结构,以将看不见的查询编码到颜色空间中。我们还展示了如何从单击结果图像的子集中的印象日志中与跨模式相关性排名一起学习这种嵌入。我们证明,使用查询图像颜色距离功能可以改善排名绩效,这是通过用户的clicked and lacked images的偏好来衡量的。
Image search engines rely on appropriately designed ranking features that capture various aspects of the content semantics as well as the historic popularity. In this work, we consider the role of colour in this relevance matching process. Our work is motivated by the observation that a significant fraction of user queries have an inherent colour associated with them. While some queries contain explicit colour mentions (such as 'black car' and 'yellow daisies'), other queries have implicit notions of colour (such as 'sky' and 'grass'). Furthermore, grounding queries in colour is not a mapping to a single colour, but a distribution in colour space. For instance, a search for 'trees' tends to have a bimodal distribution around the colours green and brown. We leverage historical clickthrough data to produce a colour representation for search queries and propose a recurrent neural network architecture to encode unseen queries into colour space. We also show how this embedding can be learnt alongside a cross-modal relevance ranker from impression logs where a subset of the result images were clicked. We demonstrate that the use of a query-image colour distance feature leads to an improvement in the ranker performance as measured by users' preferences of clicked versus skipped images.