论文标题
可靠性预测的变分单眼深度估计
Variational Monocular Depth Estimation for Reliability Prediction
论文作者
论文摘要
对单眼深度估计的自我监督学习被广泛研究为监督学习方法的替代方法,这需要很多基础真理。以前的工作通过修改模型结构,添加目标并掩盖动态对象和遮挡区域,成功地提高了深度估计的准确性。但是,当在应用程序(例如自动驾驶汽车和机器人)中使用此类估计的深度图像时,我们必须统一地相信每个像素位置的估计深度。这可能会导致执行任务的致命错误,因为某些像素的估计深度可能会导致更大的错误。在本文中,我们从理论上制定了单眼深度估计的变分模型,以预测估计深度图像的可靠性。根据结果,我们可以排除可靠性较低的估计深度,也可以将其完善以供实际使用。使用KITTI基准和Make3D数据集在定量和定性上证明了所提出方法的有效性。
Self-supervised learning for monocular depth estimation is widely investigated as an alternative to supervised learning approach, that requires a lot of ground truths. Previous works have successfully improved the accuracy of depth estimation by modifying the model structure, adding objectives, and masking dynamic objects and occluded area. However, when using such estimated depth image in applications, such as autonomous vehicles, and robots, we have to uniformly believe the estimated depth at each pixel position. This could lead to fatal errors in performing the tasks, because estimated depth at some pixels may make a bigger mistake. In this paper, we theoretically formulate a variational model for the monocular depth estimation to predict the reliability of the estimated depth image. Based on the results, we can exclude the estimated depths with low reliability or refine them for actual use. The effectiveness of the proposed method is quantitatively and qualitatively demonstrated using the KITTI benchmark and Make3D dataset.