论文标题
几何映射图像特征
Geometrically Mappable Image Features
论文作者
论文摘要
基于视觉的代理在地图中的定位是机器人技术和计算机视觉中的重要问题。在这种情况下,由于机器学习的最新进展,通过学习可匹配的图像特征通过学习匹配的图像特征的本地化变得越来越流行。独特地描述图像视觉内容的功能具有广泛的应用,包括图像检索和理解。在这项工作中,我们提出了一种学习针对基于图像的基于图像的本地化的图像功能的方法。基于检索的本地化具有多个好处,例如易于维护和快速计算。但是,最新功能仅提供视觉相似性得分,这些分数并未明确揭示查询和检索到的图像之间的几何距离。知道此距离非常需要准确的定位,尤其是当参考图像稀疏分布在场景中时。因此,我们为学习图像特征提出了一种新颖的损失函数,这些功能既具有视觉代表性又具有几何相关性。这是通过指导学习过程来实现的,以使图像之间的特征和几何距离直接成比例。在我们的实验中,我们表明我们的功能不仅提供了更好的定位精度,而且还可以在没有参考图像的情况下估算查询顺序的轨迹。
Vision-based localization of an agent in a map is an important problem in robotics and computer vision. In that context, localization by learning matchable image features is gaining popularity due to recent advances in machine learning. Features that uniquely describe the visual contents of images have a wide range of applications, including image retrieval and understanding. In this work, we propose a method that learns image features targeted for image-retrieval-based localization. Retrieval-based localization has several benefits, such as easy maintenance and quick computation. However, the state-of-the-art features only provide visual similarity scores which do not explicitly reveal the geometric distance between query and retrieved images. Knowing this distance is highly desirable for accurate localization, especially when the reference images are sparsely distributed in the scene. Therefore, we propose a novel loss function for learning image features which are both visually representative and geometrically relatable. This is achieved by guiding the learning process such that the feature and geometric distances between images are directly proportional. In our experiments we show that our features not only offer significantly better localization accuracy, but also allow to estimate the trajectory of a query sequence in absence of the reference images.