论文标题

一套用于表征选择方案的诊断指标

A suite of diagnostic metrics for characterizing selection schemes

论文作者

Hernandez, Jose Guadalupe, Lalejini, Alexander, Ofria, Charles

论文摘要

基准套件对于评估进化算法的性能至关重要,但是组成的问题通常太复杂了,无法对算法的优势和劣势提供明确的直觉。为了解决这一差距,我们介绍了档案(“进化运行中选择方案的诊断概述”),这是一个诊断套件,最初由八个手工制作的指标组成。这些指标旨在从经验上衡量特定的剥削,探索及其相互作用的能力。我们考虑在有和没有约束的情况下进行剥削,并将探索分为两个方面:多样性探索(同时探索多个途径的能力)和山谷交叉探索(跨越越来越宽的健身谷)。我们将档案应用于六个流行的选择方案:截断,比赛,健身共享,词典,非主导分类和新颖性搜索。我们的结果证实,简单的计划(例如,比赛和截断)强调了剥削。但是,对于更复杂的方案,我们的诊断揭示了有趣的动态。在所有没有结合山谷过境的诊断术中,词汇酶选择的表现都很好,但每当存在山谷时都会显着步履蹒跚,其性能比随机搜索还要差。健身共享是唯一有效与山谷过境竞争的计划,但它与其他诊断障碍斗争。我们的研究强调了使用诊断来获得对选择方案特征的细微见解的实用性,这可以为新选择方法的设计提供信息。

Benchmark suites are crucial for assessing the performance of evolutionary algorithms, but the constituent problems are often too complex to provide clear intuition about an algorithm's strengths and weaknesses. To address this gap, we introduce DOSSIER ("Diagnostic Overview of Selection Schemes In Evolutionary Runs"), a diagnostic suite initially composed of eight handcrafted metrics. These metrics are designed to empirically measure specific capacities for exploitation, exploration, and their interactions. We consider exploitation both with and without constraints, and we divide exploration into two aspects: diversity exploration (the ability to simultaneously explore multiple pathways) and valley-crossing exploration (the ability to cross wider and wider fitness valleys). We apply DOSSIER to six popular selection schemes: truncation, tournament, fitness sharing, lexicase, nondominated sorting, and novelty search. Our results confirm that simple schemes (e.g., tournament and truncation) emphasized exploitation. For more sophisticated schemes, however, our diagnostics revealed interesting dynamics. Lexicase selection performed moderately well across all diagnostics that did not incorporate valley crossing, but faltered dramatically whenever valleys were present, performing worse than even random search. Fitness sharing was the only scheme to effectively contend with valley crossing but it struggled with the other diagnostics. Our study highlights the utility of using diagnostics to gain nuanced insights into selection scheme characteristics, which can inform the design of new selection methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源