论文标题

明智的选择和选择单

Smart Choices and the Selection Monad

论文作者

Abadi, Martin, Plotkin, Gordon

论文摘要

从选择及其产生的成本和奖励方面描述系统提供了释放算法设计师和程序员的承诺。在实施中,可以通过优化技术以及越来越多的机器学习方法来实现选择。我们从编程语言的角度研究这种方法。我们定义两种支持决策摘要的小语言:一种具有选择和奖励,另一种是概率。我们提供操作和典型语义。 在第二语言的情况下,我们考虑了三种表示的语义,在可能的程序值和预期奖励之间存在不同程度的相关性。操作语义将标准构造的通常语义与对可能执行策略的空间进行优化。构成的典型语义依赖于选择单元来处理选择,并用辅助单子增强以处理其他效果,例如奖励或概率。 我们确定了在所有情况下两种语义重合的适当定理。我们还证明了基本类型的完全抽象,在概率的情况下,观察概念变化,与各种相关程度相对应。我们提出了选择的公理与奖励和概率相结合,在没有概率的情况下以奖励的情况下在基本类型上建立完整性。

Describing systems in terms of choices and their resulting costs and rewards offers the promise of freeing algorithm designers and programmers from specifying how those choices should be made; in implementations, the choices can be realized by optimization techniques and, increasingly, by machine-learning methods. We study this approach from a programming-language perspective. We define two small languages that support decision-making abstractions: one with choices and rewards, and the other additionally with probabilities. We give both operational and denotational semantics. In the case of the second language we consider three denotational semantics, with varying degrees of correlation between possible program values and expected rewards. The operational semantics combine the usual semantics of standard constructs with optimization over spaces of possible execution strategies. The denotational semantics, which are compositional, rely on the selection monad, to handle choice, augmented with an auxiliary monad to handle other effects, such as rewards or probability. We establish adequacy theorems that the two semantics coincide in all cases. We also prove full abstraction at base types, with varying notions of observation in the probabilistic case corresponding to the various degrees of correlation. We present axioms for choice combined with rewards and probability, establishing completeness at base types for the case of rewards without probability.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源