论文标题
学习积极学习:一种强大的方法
Learning to Actively Learn: A Robust Approach
论文作者
论文摘要
这项工作提出了一个程序,用于设计针对特定自适应数据收集任务(如主动学习和纯探索多军匪徒)的算法。与依赖于度量集中和仔细分析以证明程序的正确性和样本复杂性的传统自适应算法的设计不同,我们的自适应算法是通过对等效性问题从信息理论下限得出的对等效性类别的对等效性类别中学到的。特别是,学到了一种自适应学习算法,该算法与每个等效类别学习的最佳自适应算法竞争。我们的过程仅作为输入仅提供的可用查询,一组假设,损失功能和总查询预算。这与现有的元学习工作形成鲜明对比的是,相对于明确的,用户定义的子集或先验分布而在问题上挑战并与在测试时遇到的实例不匹配的问题上学习了自适应算法。当总查询预算很小时,这项工作尤其集中在政权上,例如几十个,这比理论上得出的算法通常考虑的预算要小得多。我们执行合成实验,以证明培训程序的稳定性和有效性是合理的,然后评估来自真实数据的任务的方法,包括嘈杂的20个问题游戏和笑话建议任务。
This work proposes a procedure for designing algorithms for specific adaptive data collection tasks like active learning and pure-exploration multi-armed bandits. Unlike the design of traditional adaptive algorithms that rely on concentration of measure and careful analysis to justify the correctness and sample complexity of the procedure, our adaptive algorithm is learned via adversarial training over equivalence classes of problems derived from information theoretic lower bounds. In particular, a single adaptive learning algorithm is learned that competes with the best adaptive algorithm learned for each equivalence class. Our procedure takes as input just the available queries, set of hypotheses, loss function, and total query budget. This is in contrast to existing meta-learning work that learns an adaptive algorithm relative to an explicit, user-defined subset or prior distribution over problems which can be challenging to define and be mismatched to the instance encountered at test time. This work is particularly focused on the regime when the total query budget is very small, such as a few dozen, which is much smaller than those budgets typically considered by theoretically derived algorithms. We perform synthetic experiments to justify the stability and effectiveness of the training procedure, and then evaluate the method on tasks derived from real data including a noisy 20 Questions game and a joke recommendation task.