论文标题
优化器的主要措施
Majorizing Measures for the Optimizer
论文作者
论文摘要
由Fernique,Talagrand和其他许多人广泛发展的主要措施理论提供了控制随机过程行为的最通用框架之一。特别是,它可以应用于在预期的上超出的定量界限和许多过程的样品路径的连续性程度。 该理论的最高成就之一是Talagrand在主要措施方面对高斯过程的最高表征进行了紧密的替代表征。该定理的证据很困难,因此,付出了巨大的努力,即制定较短且易于理解的证据。造成这种困难的一个主要原因被认为是使措施本身的主要理论,而措施本身具有不透明和神秘的声誉。结果,该理论的最新处理(包括塔拉格兰本人)避免使用主要措施来使用纯粹的组合方法(通用链接),其中基于分区序列的对象提供了与所需的预期较高的上限相匹配的上限和下限。 在本文中,我们返回将措施作为研究的主要对象,并给出一个我们认为是自然和从优化的角度澄清的观点。作为我们的主要贡献,我们为基于两个部分的主要措施定理提供了算法证明:(1)我们使简单(但显然是新的)观察结果,即找到最佳的主要大型措施可以作为凸面程序施放。这还允许使用凸优化的现成方法有效地计算该度量。 (2)我们通过一系列步骤将基于树的上限和下限证书获得了该凸面程序的原始和双重解决方案。 [...]
The theory of majorizing measures, extensively developed by Fernique, Talagrand and many others, provides one of the most general frameworks for controlling the behavior of stochastic processes. In particular, it can be applied to derive quantitative bounds on the expected suprema and the degree of continuity of sample paths for many processes. One of the crowning achievements of the theory is Talagrand's tight alternative characterization of the suprema of Gaussian processes in terms of majorizing measures. The proof of this theorem was difficult, and thus considerable effort was put into the task of developing both shorter and easier to understand proofs. A major reason for this difficulty was considered to be theory of majorizing measures itself, which had the reputation of being opaque and mysterious. As a consequence, most recent treatments of the theory (including by Talagrand himself) have eschewed the use of majorizing measures in favor of a purely combinatorial approach (the generic chaining) where objects based on sequences of partitions provide roughly matching upper and lower bounds on the desired expected supremum. In this paper, we return to majorizing measures as a primary object of study, and give a viewpoint that we think is natural and clarifying from an optimization perspective. As our main contribution, we give an algorithmic proof of the majorizing measures theorem based on two parts: (1) We make the simple (but apparently new) observation that finding the best majorizing measure can be cast as a convex program. This also allows for efficiently computing the measure using off-the-shelf methods from convex optimization. (2) We obtain tree-based upper and lower bound certificates by rounding, in a series of steps, the primal and dual solutions to this convex program. [...]