论文标题
人工神经网络的伪造主义叙述
A Falsificationist Account of Artificial Neural Networks
论文作者
论文摘要
机器学习在统计与计算机科学的交集处运行。这就提出了有关其基本方法的问题。尽管从数据学习和归纳过程之间的学习过程之间的紧密联系中,机器学习的伪造主义组成部分受到了较小的关注。在本文中,我们认为伪造的想法对于机器学习方法论至关重要。人们普遍认为,机器学习算法从过去的观察结果中推断出一般的预测规则。这类似于从数据样本中获得估计的统计程序。但是机器学习算法也可以描述为从整个功能选择一个预测规则。特别是,确定人工神经网络的权重的算法是通过经验风险最小化而运作的,并且拒绝缺乏经验充足性的预测规则。它还表现出隐性正则化的行为,将假设选择推向了更简单的预测规则。我们认为,将这两个方面共同构成了人工神经网络的伪造主义描述。
Machine learning operates at the intersection of statistics and computer science. This raises the question as to its underlying methodology. While much emphasis has been put on the close link between the process of learning from data and induction, the falsificationist component of machine learning has received minor attention. In this paper, we argue that the idea of falsification is central to the methodology of machine learning. It is commonly thought that machine learning algorithms infer general prediction rules from past observations. This is akin to a statistical procedure by which estimates are obtained from a sample of data. But machine learning algorithms can also be described as choosing one prediction rule from an entire class of functions. In particular, the algorithm that determines the weights of an artificial neural network operates by empirical risk minimization and rejects prediction rules that lack empirical adequacy. It also exhibits a behavior of implicit regularization that pushes hypothesis choice toward simpler prediction rules. We argue that taking both aspects together gives rise to a falsificationist account of artificial neural networks.