通过建模视觉内容来提高新广告的CTR预测

论文标题

通过建模视觉内容来提高新广告的CTR预测

Boost CTR Prediction for New Advertisements via Modeling Visual Content

论文作者

Yu, Tan, Jin, Zhipeng, Liu, Jie, Yang, Yi, Fei, Hongliang, Li, Ping

论文摘要

现有的广告点击率（CTR）预测模型主要取决于行为ID功能，这些功能是根据历史用户AD交互所学到的。然而，依靠历史用户行为的行为ID功能是不可行的，不用以前与用户进行交互。为了克服在建模新广告中的行为ID特征的局限性，我们利用广告中的视觉内容来提高CTR预测模型的性能。具体来说，我们根据其视觉内容将每个广告映射到一组视觉ID中。这些视觉ID进一步用于生成可视觉嵌入，以增强CTR预测模型。我们将视觉ID的学习分为有监督的量化问题。由于缺乏广告中商业图像的类标签，我们利用图像文本描述作为监督，以优化图像提取器以生成有效的视觉ID。同时，由于硬量化是不可差异的，因此我们软化量化操作以使其支持端到端网络培训。将每个图像映射到视觉ID中后，我们根据过去积累的历史用户AD交互学习每个视觉ID的嵌入。由于视觉ID嵌入仅取决于视觉内容，因此它概括为新广告。同时，嵌入视觉ID补充了AD行为ID嵌入。因此，它可以大大提高CTR预测模型的性能，该模型以前依赖于积累了丰富用户行为的新广告和广告的行为ID功能。将视觉ID嵌入在BAIDU在线广告的CTR预测模型中后，AD的平均CTR提高了1.46％，总费用增加了1.10％。

Existing advertisements click-through rate (CTR) prediction models are mainly dependent on behavior ID features, which are learned based on the historical user-ad interactions. Nevertheless, behavior ID features relying on historical user behaviors are not feasible to describe new ads without previous interactions with users. To overcome the limitations of behavior ID features in modeling new ads, we exploit the visual content in ads to boost the performance of CTR prediction models. Specifically, we map each ad into a set of visual IDs based on its visual content. These visual IDs are further used for generating the visual embedding for enhancing CTR prediction models. We formulate the learning of visual IDs into a supervised quantization problem. Due to a lack of class labels for commercial images in advertisements, we exploit image textual descriptions as the supervision to optimize the image extractor for generating effective visual IDs. Meanwhile, since the hard quantization is non-differentiable, we soften the quantization operation to make it support the end-to-end network training. After mapping each image into visual IDs, we learn the embedding for each visual ID based on the historical user-ad interactions accumulated in the past. Since the visual ID embedding depends only on the visual content, it generalizes well to new ads. Meanwhile, the visual ID embedding complements the ad behavior ID embedding. Thus, it can considerably boost the performance of the CTR prediction models previously relying on behavior ID features for both new ads and ads that have accumulated rich user behaviors. After incorporating the visual ID embedding in the CTR prediction model of Baidu online advertising, the average CTR of ads improves by 1.46%, and the total charge increases by 1.10%.

下载PDF全文

下载文献需遵守相关版权规定

论文标题