论文标题
PNKH-B:一种预计的牛顿 - 克里洛夫(Newton-Krylov)方法,用于大规模约束优化
PNKH-B: A Projected Newton-Krylov Method for Large-Scale Bound-Constrained Optimization
论文作者
论文摘要
我们提出了PNKH-B,这是一种预测的Newton-Krylov方法,用于迭代解决具有约束约束的大规模优化问题。 PNKH-B针对功能和梯度评估昂贵的情况,而(近似)Hessian仅通过矩阵矢量产品可用。在大规模参数估计,机器学习和图像处理中通常情况是这种情况。在每次迭代中,PNKH-B都使用(近似)Hessian的低级别近似来确定搜索方向并构建投影线搜索中使用的度量。度量标准的关键特征是它与Krylov子空间上Hessian的低级别近似的一致性。这使PNKH-B类似于预测的可变度量方法。我们提出了一种有效解决二次投影问题的内点方法。由于内点方法有效利用了低级别的结构,因此其计算成本仅相对于变量数量线性缩放,并且仅增加了可忽略的计算时间。我们还尝试了PNKH-B的变体,该变体将活性集的估计值纳入Hessian近似值中。我们证明了在标准假设下的全球融合到固定点。使用三个由参数估计,机器学习和图像重建动机的数值实验,我们表明,在PNKH-B中使用Hessian指标的一致使用会导致快速收敛,尤其是在前几次迭代中。我们在https://github.com/emorymlip/pnkh-b上提供MATLAB实施。
We present PNKH-B, a projected Newton-Krylov method for iteratively solving large-scale optimization problems with bound constraints. PNKH-B is geared toward situations in which function and gradient evaluations are expensive, and the (approximate) Hessian is only available through matrix-vector products. This is commonly the case in large-scale parameter estimation, machine learning, and image processing. In each iteration, PNKH-B uses a low-rank approximation of the (approximate) Hessian to determine the search direction and construct the metric used in a projected line search. The key feature of the metric is its consistency with the low-rank approximation of the Hessian on the Krylov subspace. This renders PNKH-B similar to a projected variable metric method. We present an interior point method to solve the quadratic projection problem efficiently. Since the interior point method effectively exploits the low-rank structure, its computational cost only scales linearly with respect to the number of variables, and it only adds negligible computational time. We also experiment with variants of PNKH-B that incorporate estimates of the active set into the Hessian approximation. We prove the global convergence to a stationary point under standard assumptions. Using three numerical experiments motivated by parameter estimation, machine learning, and image reconstruction, we show that the consistent use of the Hessian metric in PNKH-B leads to fast convergence, particularly in the first few iterations. We provide our MATLAB implementation at https://github.com/EmoryMLIP/PNKH-B.