论文标题
将密度功能理论放在机器学习加速材料中的测试中发现
Putting Density Functional Theory to the Test in Machine-Learning-Accelerated Materials Discovery
论文作者
论文摘要
随着机器学习(ML)的加速发现,已经开始提供克服计算材料设计的组合挑战所需的效率进步。然而,ML加速的发现都继承了从密度功能理论(DFT)得出的训练数据的偏见,并导致许多尝试失败的尝试计算。许多引人入胜的功能材料和催化过程涉及到敞开的壳过渡金属中心的应变化学键,开壳自由基和圆锥键或金属有机键。尽管有希望的目标,但这些材料对电子结构方法和共同发现的挑战带来了独特的挑战。从这个角度来看,我们描述了准确性,效率和方法所需的进步,超出了传统的基于DFT的ML工作流程中所需的进步。这些挑战已开始通过训练有素的ML模型来解决多种方法的结果或它们之间的差异,从而实现了定量灵敏度分析。为了使DFT在高通量屏幕中对给定数据点受到信任,它必须通过一系列测试。预测计算成功并检测强相关性存在的可能性的ML模型将实现快速诊断和适应策略。这些“决策引擎”代表了迈向自主工作流的第一步,避免需要专家确定基于DFT的材料发现的鲁棒性。
Accelerated discovery with machine learning (ML) has begun to provide the advances in efficiency needed to overcome the combinatorial challenge of computational materials design. Nevertheless, ML-accelerated discovery both inherits the biases of training data derived from density functional theory (DFT) and leads to many attempted calculations that are doomed to fail. Many compelling functional materials and catalytic processes involve strained chemical bonds, open-shell radicals and diradicals, or metal-organic bonds to open-shell transition-metal centers. Although promising targets, these materials present unique challenges for electronic structure methods and combinatorial challenges for their discovery. In this Perspective, we describe the advances needed in accuracy, efficiency, and approach beyond what is typical in conventional DFT-based ML workflows. These challenges have begun to be addressed through ML models trained to predict the results of multiple methods or the differences between them, enabling quantitative sensitivity analysis. For DFT to be trusted for a given data point in a high-throughput screen, it must pass a series of tests. ML models that predict the likelihood of calculation success and detect the presence of strong correlation will enable rapid diagnoses and adaptation strategies. These "decision engines" represent the first steps toward autonomous workflows that avoid the need for expert determination of the robustness of DFT-based materials discoveries.