基于Jensen Shannon信息的学习算法的概括误差的表征

论文标题

基于Jensen Shannon信息的学习算法的概括误差的表征

Jensen-Shannon Information Based Characterization of the Generalization Error of Learning Algorithms

论文作者

Aminian, Gholamali, Toni, Laura, Rodrigues, Miguel R. D.

论文摘要

概括误差范围对于理解机器学习模型的性能至关重要。在这项工作中，我们提出了一个新的基于信息理论的概括错误上限适用于监督学习方案。我们表明，我们的一般界限可以专门研究各种以前的范围。我们还表明，在某些条件下，我们的一般界限可以专门针对新的结合，涉及詹森 - 香农信息的随机变量建模训练样本集和另一个随机变量对假设进行建模。我们还证明，在某些条件下，我们的界限可能比基于信息的界限更紧。

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, we propose a new information-theoretic based generalization error upper bound applicable to supervised learning scenarios. We show that our general bound can specialize in various previous bounds. We also show that our general bound can be specialized under some conditions to a new bound involving the Jensen-Shannon information between a random variable modelling the set of training samples and another random variable modelling the hypothesis. We also prove that our bound can be tighter than mutual information-based bounds under some conditions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题