域自适应3D姿势增加，用于野生网状恢复

论文标题

域自适应3D姿势增加，用于野生网状恢复

Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery

论文作者

Weng, Zhenzhen, Wang, Kuan-Chieh, Kanazawa, Angjoo, Yeung, Serena

论文摘要

从单个图像中感知3D人体的能力具有多种应用，从娱乐和机器人技术到神经科学和医疗保健。人类网格恢复中的一个根本挑战是收集训练所需的地面真相3D网格目标，这需要负担重的运动捕获系统，并且通常仅限于室内实验室。结果，尽管在这些限制性设置中收集的基准数据集上取得了进展，但由于分配变化，模型无法推广到现实世界中的“野外”场景。我们提出了域自适应3D姿势增强（DAPA），这是一种数据增强方法，可增强模型在野外场景中的概括能力。 DAPA通过从目标数据集中使用地面真相2D关键点，通过从合成网格中获得直接监督，结合了基于合成数据集的方法的强度。我们定量地表明，使用DAPA的填充有效地改善了基准3DPW和Agora的结果。我们进一步证明了DAPA在一个充满挑战的数据集中，该数据集策划了现实世界中亲子互动的视频。

The ability to perceive 3D human bodies from a single image has a multitude of applications ranging from entertainment and robotics to neuroscience and healthcare. A fundamental challenge in human mesh recovery is in collecting the ground truth 3D mesh targets required for training, which requires burdensome motion capturing systems and is often limited to indoor laboratories. As a result, while progress is made on benchmark datasets collected in these restrictive settings, models fail to generalize to real-world "in-the-wild" scenarios due to distribution shifts. We propose Domain Adaptive 3D Pose Augmentation (DAPA), a data augmentation method that enhances the model's generalization ability in in-the-wild scenarios. DAPA combines the strength of methods based on synthetic datasets by getting direct supervision from the synthesized meshes, and domain adaptation methods by using ground truth 2D keypoints from the target dataset. We show quantitatively that finetuning with DAPA effectively improves results on benchmarks 3DPW and AGORA. We further demonstrate the utility of DAPA on a challenging dataset curated from videos of real-world parent-child interaction.

下载PDF全文

下载文献需遵守相关版权规定

论文标题