NLMVS-NET：深度非lambertian多视图立体声

论文标题

NLMVS-NET：深度非lambertian多视图立体声

nLMVS-Net: Deep Non-Lambertian Multi-View Stereo

论文作者

Yamashita, Kohei, Enyo, Yuto, Nobuhara, Shohei, Nishino, Ko

论文摘要

我们介绍了一种新型的多视图立体声（MVS）方法，该方法不仅可以同时恢复每个像素深度，而且可以恢复表面正常状态，以及在已知但自然照明下捕获的无纹理，复杂的非较稀有表面的反射。我们的关键想法是将MVs作为端到端学习网络，我们称为NLMVS-NET，该网络无缝地集成了辐射识别线索，以利用表面正常状态作为视图的表面特征，用于学习成本量构建和过滤。它首先通过新型的形状从阴影网络估算出每个视图的像素概率密度。这些人均表面正常密度和输入多视图图像将输入到一个新颖的成本量滤波网络中，该网络学会恢复每个像素深度和表面正常。反射率也通过与几何重建交替来明确估计。对新建立的合成和现实世界数据集进行了广泛的定量评估表明，NLMVS-NET可以在自然设置中稳健而准确地恢复复杂物体的形状和反射率。

We introduce a novel multi-view stereo (MVS) method that can simultaneously recover not just per-pixel depth but also surface normals, together with the reflectance of textureless, complex non-Lambertian surfaces captured under known but natural illumination. Our key idea is to formulate MVS as an end-to-end learnable network, which we refer to as nLMVS-Net, that seamlessly integrates radiometric cues to leverage surface normals as view-independent surface features for learned cost volume construction and filtering. It first estimates surface normals as pixel-wise probability densities for each view with a novel shape-from-shading network. These per-pixel surface normal densities and the input multi-view images are then input to a novel cost volume filtering network that learns to recover per-pixel depth and surface normal. The reflectance is also explicitly estimated by alternating with geometry reconstruction. Extensive quantitative evaluations on newly established synthetic and real-world datasets show that nLMVS-Net can robustly and accurately recover the shape and reflectance of complex objects in natural settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题