论文标题
在线学习需求最大公平
Online Learning Demands in Max-min Fairness
论文作者
论文摘要
我们以高效,公平和防止策略的方式描述了在多个用户中分配稀缺资源的机制,但是当用户不知道其资源需求时。重复多个回合的机制,用户的要求在每个回合中可能会发生变化。在每回合结束时,用户提供有关收到的分配的反馈,使该机制可以随着时间的推移学习用户偏好。这种情况在组织中许多用户的计算群集的共同用法中很常见,在这些用户中,所有团队可能并不确切地知道执行工作所需的资源量。通过低估他们的要求,用户收到的收益将比需要的要少,因此无法实现他们的目标。通过夸大其词,它们可能会消除对组织中其他人可能有用的宝贵资源。我们通过适用于此环境的效率,公平性和防止策略的概念在公平划分的在线学习任务正式化,并在三种反馈类型的情况下研究此问题:当用户的观察确定性时,当它们是随机主义时,并且遵循参数模型,并且当它们是随机性和非参数时。我们得出受到实现这些必需品的经典最大公平程序启发的机制,并量化了通过渐近率实现的程度。我们通过对合成问题和网络服务任务的实验评估来证实这些见解。
We describe mechanisms for the allocation of a scarce resource among multiple users in a way that is efficient, fair, and strategy-proof, but when users do not know their resource requirements. The mechanism is repeated for multiple rounds and a user's requirements can change on each round. At the end of each round, users provide feedback about the allocation they received, enabling the mechanism to learn user preferences over time. Such situations are common in the shared usage of a compute cluster among many users in an organisation, where all teams may not precisely know the amount of resources needed to execute their jobs. By understating their requirements, users will receive less than they need and consequently not achieve their goals. By overstating them, they may siphon away precious resources that could be useful to others in the organisation. We formalise this task of online learning in fair division via notions of efficiency, fairness, and strategy-proofness applicable to this setting, and study this problem under three types of feedback: when the users' observations are deterministic, when they are stochastic and follow a parametric model, and when they are stochastic and nonparametric. We derive mechanisms inspired by the classical max-min fairness procedure that achieve these requisites, and quantify the extent to which they are achieved via asymptotic rates. We corroborate these insights with an experimental evaluation on synthetic problems and a web-serving task.