论文标题
贝叶斯神经网络中的分析互助信息
Analytic Mutual Information in Bayesian Neural Networks
论文作者
论文摘要
在许多应用程序问题(包括不确定性量化)中,贝叶斯神经网络已成功设计和优化了强大的神经网络模型。但是,随着最近的成功,对贝叶斯神经网络的信息理论理解仍处于早期阶段。相互信息是贝叶斯神经网络中不确定性度量的一个例子,以量化认知不确定性。尽管如此,尚无分析公式来描述它,这是了解贝叶斯深度学习框架的基本信息指标之一。在本文中,我们通过利用点过程熵的概念来得出模型参数和预测输出之间共同信息的分析公式。然后,作为应用程序,我们通过证明我们的分析公式可以在实践中进一步提高主动学习的性能,从而讨论DIRICHLET分布的参数估计,并显示其在主动学习不确定性度量中的实际应用。
Bayesian neural networks have successfully designed and optimized a robust neural network model in many application problems, including uncertainty quantification. However, with its recent success, information-theoretic understanding about the Bayesian neural network is still at an early stage. Mutual information is an example of an uncertainty measure in a Bayesian neural network to quantify epistemic uncertainty. Still, no analytic formula is known to describe it, one of the fundamental information measures to understand the Bayesian deep learning framework. In this paper, we derive the analytical formula of the mutual information between model parameters and the predictive output by leveraging the notion of the point process entropy. Then, as an application, we discuss the parameter estimation of the Dirichlet distribution and show its practical application in the active learning uncertainty measures by demonstrating that our analytical formula can improve the performance of active learning further in practice.