论文标题
在监督机器学习中进行决策的学习曲线:调查
Learning Curves for Decision Making in Supervised Machine Learning: A Survey
论文作者
论文摘要
学习曲线是在机器学习中采用的社会科学的概念,以评估有关某些资源的学习算法的性能,例如培训示例的数量或培训迭代的数量。学习曲线在几种机器学习环境中具有重要的应用,最著名的是在数据获取,模型培训的早期停止和模型选择中。例如,学习曲线可用于建模算法及其超参数配置的组合性能,从而在早期阶段提供对其潜在适合性的见解,并经常加快算法选择过程。已经提出了各种学习曲线模型将学习曲线用于决策。其中一些模型回答了一个二进制决策问题,即在一定预算中给定的算法是否会胜过一定的参考性能,而更复杂的模型可以预测算法的整个学习曲线。我们贡献了一个框架,该框架使用三个标准对学习曲线方法进行分类:他们解决的决策情况,他们回答的内在学习曲线问题以及他们使用的资源类型。我们调查文献中的论文并将其分类为该框架。
Learning curves are a concept from social sciences that has been adopted in the context of machine learning to assess the performance of a learning algorithm with respect to a certain resource, e.g., the number of training examples or the number of training iterations. Learning curves have important applications in several machine learning contexts, most notably in data acquisition, early stopping of model training, and model selection. For instance, learning curves can be used to model the performance of the combination of an algorithm and its hyperparameter configuration, providing insights into their potential suitability at an early stage and often expediting the algorithm selection process. Various learning curve models have been proposed to use learning curves for decision making. Some of these models answer the binary decision question of whether a given algorithm at a certain budget will outperform a certain reference performance, whereas more complex models predict the entire learning curve of an algorithm. We contribute a framework that categorises learning curve approaches using three criteria: the decision-making situation they address, the intrinsic learning curve question they answer and the type of resources they use. We survey papers from the literature and classify them into this framework.