论文标题
通过多阶多构造深网的强大面部地标检测
Robust Facial Landmark Detection by Multi-order Multi-constraint Deep Networks
论文作者
论文摘要
最近,在面部标志性检测中广泛探索了热图回归,并获得了出色的性能。但是,大多数现有的基于热图回归的面部标志性检测方法忽略了探索高阶特征相关性,这对于学习更多代表性特征并增强形状约束非常重要。此外,最终预测的地标尚未添加明确的全球形状约束,从而导致准确性降低。为了解决这些问题,在本文中,我们提出了一个多阶多构造深网(MMDN),以实现更强大的特征相关性和形状约束学习。具体而言,提出了一个隐式多阶相关几何感知(IMCG)模型,以引入多阶空间相关性和多阶通道相关性,以实现更具歧视性表示。此外,开发了一种基于明确的概率边界自适应回归(EPBR)方法,以增强全局形状约束,并进一步搜索在预测的边界中的语义一致的地标,以进行强大的面部标志性检测。有趣的是,提出的MMDN可以生成更准确的边界自适应地标热图,并有效地增强了对具有较大姿势变化和重型遮挡的面孔的预测地标的形状约束。关于挑战基准数据集的实验结果证明了我们的MMDN优于最先进的面部标志检测方法。该代码已在https://github.com/junwan2014/mmmdn-master上公开获得。
Recently, heatmap regression has been widely explored in facial landmark detection and obtained remarkable performance. However, most of the existing heatmap regression-based facial landmark detection methods neglect to explore the high-order feature correlations, which is very important to learn more representative features and enhance shape constraints. Moreover, no explicit global shape constraints have been added to the final predicted landmarks, which leads to a reduction in accuracy. To address these issues, in this paper, we propose a Multi-order Multi-constraint Deep Network (MMDN) for more powerful feature correlations and shape constraints learning. Specifically, an Implicit Multi-order Correlating Geometry-aware (IMCG) model is proposed to introduce the multi-order spatial correlations and multi-order channel correlations for more discriminative representations. Furthermore, an Explicit Probability-based Boundary-adaptive Regression (EPBR) method is developed to enhance the global shape constraints and further search the semantically consistent landmarks in the predicted boundary for robust facial landmark detection. It's interesting to show that the proposed MMDN can generate more accurate boundary-adaptive landmark heatmaps and effectively enhance shape constraints to the predicted landmarks for faces with large pose variations and heavy occlusions. Experimental results on challenging benchmark datasets demonstrate the superiority of our MMDN over state-of-the-art facial landmark detection methods. The code has been publicly available at https://github.com/junwan2014/MMDN-master.