论文标题
具有多元反馈控制的动态招标策略,用于显示广告的多个目标
Dynamic Bidding Strategies with Multivariate Feedback Control for Multiple Goals in Display Advertising
论文作者
论文摘要
实时竞标(RTB)展示广告是一种在毫秒内拍卖中购买广告清单的一种方法。 RTB广告系列的性能通常通过一系列关键绩效指标(KPI)来衡量 - 用于确保广告系列具有成本效益的测量值,并且正在购买有价值的库存。尽管理想情况下,RTB运动应该满足所有KPI,但同时改进往往非常具有挑战性,因为任何一个KPI的改进都可能对其他人产生不利影响。在这里,我们提出了一种使用基于PID的反馈控制系统同时控制多个KPI的方法。此方法基于PID控制器模块的输出和量化每个KPI对内部业务需求的重要性的度量,为每个KPI生成一个控制分数。在定期的时间间隔内,该算法 - 顺序控制 - 将选择具有最大整体改进需求的KPI。这样,我们的算法就可以不断寻求对当前状态的最大边缘改进。可以与每个KPI相关联多种控制方法,并且可以同时或随机选择,以避免局部Optima。在离线竞标模拟和对实时流量的测试中,我们的方法被证明在同时控制多个KPI并将其带入各自目标方面有效。
Real-Time Bidding (RTB) display advertising is a method for purchasing display advertising inventory in auctions that occur within milliseconds. The performance of RTB campaigns is generally measured with a series of Key Performance Indicators (KPIs) - measurements used to ensure that the campaign is cost-effective and that it is purchasing valuable inventory. While an RTB campaign should ideally meet all KPIs, simultaneous improvement tends to be very challenging, as an improvement to any one KPI risks a detrimental effect toward the others. Here we present an approach to simultaneously controlling multiple KPIs with a PID-based feedback-control system. This method generates a control score for each KPI, based on both the output of a PID controller module and a metric that quantifies the importance of each KPI for internal business needs. On regular intervals, this algorithm - Sequential Control - will choose the KPI with the greatest overall need for improvement. In this way, our algorithm is able to continually seek the greatest marginal improvements to its current state. Multiple methods of control can be associated with each KPI, and can be triggered either simultaneously or chosen stochastically, in order to avoid local optima. In both offline ad bidding simulations and testing on live traffic, our methods proved to be effective in simultaneously controlling multiple KPIs, and bringing them toward their respective goals.