论文标题
揭开正交的蒙特卡洛和其他
Demystifying Orthogonal Monte Carlo and Beyond
论文作者
论文摘要
正交蒙特卡洛(OMC)是一种非常有效的采样算法,在样品上施加了结构性几何条件(正交性),以减少方差。由于其简单性和优越的性能,与其准蒙特卡洛对应物相比,OMC用于广泛的机器学习应用中,从可扩展的内核方法到预测性的复发性神经网络,生成模型和增强学习。但是,对该方法的理论理解仍然非常有限。在本文中,我们对OMC背后的理论原理进行了新的启示,并采用了负依赖的随机变量理论来获得几个新的浓度结果。我们还提出了一种杠杆数理论技术和粒子算法的新型扩展,称为近乎正交的蒙特卡洛(NOMC)。我们表明,NOMC是第一个算法在从内核方法到概率度量空间中近似距离的应用中始终优于OMC。
Orthogonal Monte Carlo (OMC) is a very effective sampling algorithm imposing structural geometric conditions (orthogonality) on samples for variance reduction. Due to its simplicity and superior performance as compared to its Quasi Monte Carlo counterparts, OMC is used in a wide spectrum of challenging machine learning applications ranging from scalable kernel methods to predictive recurrent neural networks, generative models and reinforcement learning. However theoretical understanding of the method remains very limited. In this paper we shed new light on the theoretical principles behind OMC, applying theory of negatively dependent random variables to obtain several new concentration results. We also propose a novel extensions of the method leveraging number theory techniques and particle algorithms, called Near-Orthogonal Monte Carlo (NOMC). We show that NOMC is the first algorithm consistently outperforming OMC in applications ranging from kernel methods to approximating distances in probabilistic metric spaces.