竞争AI：竞争反馈如何影响机器学习？

论文标题

竞争AI：竞争反馈如何影响机器学习？

Competing AI: How does competition feedback affect machine learning?

论文作者

Ginart, Antonio, Zhang, Eva, Kwon, Yongchan, Zou, James

论文摘要

本论文研究竞争如何影响机器学习（ML）预测因素。随着ML变得越来越普遍，公司通常会部署它以竞争客户。例如，像Yelp这样的数字平台使用ML来预测用户偏好并提出建议。用户通常会查询的服务，也许是因为它更准确地预测用户的偏好，也更有可能获得其他用户数据（例如，以Yelp审查的形式）。因此，竞争性预测因素会导致反馈循环，从而导致预测器的性能会影响其收到的训练数据，并随着时间的推移而偏见其预测。我们引入了竞争ML预测因子的灵活模型，该模型既可以快速实验和理论障碍。我们通过经验和数学分析表明，竞争会导致预测因素专门针对特定的亚人群，而在整个人群中的表现较差。我们进一步分析了预测器专业化对用户总体预测质量的影响。我们表明，在市场中拥有太少或太多的竞争预测因子可能会损害总体预测质量。使用流行的学习算法，例如神经网络和最近的邻居方法，在几个真实数据集上进行了实验来补充我们的理论。

This papers studies how competition affects machine learning (ML) predictors. As ML becomes more ubiquitous, it is often deployed by companies to compete over customers. For example, digital platforms like Yelp use ML to predict user preference and make recommendations. A service that is more often queried by users, perhaps because it more accurately anticipates user preferences, is also more likely to obtain additional user data (e.g. in the form of a Yelp review). Thus, competing predictors cause feedback loops whereby a predictor's performance impacts what training data it receives and biases its predictions over time. We introduce a flexible model of competing ML predictors that enables both rapid experimentation and theoretical tractability. We show with empirical and mathematical analysis that competition causes predictors to specialize for specific sub-populations at the cost of worse performance over the general population. We further analyze the impact of predictor specialization on the overall prediction quality experienced by users. We show that having too few or too many competing predictors in a market can hurt the overall prediction quality. Our theory is complemented by experiments on several real datasets using popular learning algorithms, such as neural networks and nearest neighbor methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题