在视频镶嵌应用中，有效的重叠帧的主动注释

论文标题

在视频镶嵌应用中，有效的重叠帧的主动注释

Active Annotation of Informative Overlapping Frames in Video Mosaicking Applications

论文作者

Peter, Loic, Tella-Amo, Marcel, Shakir, Dzhoshkun Ismail, Deprest, Jan, Ourselin, Sebastien, Iglesias, Juan Eugenio, Vercauteren, Tom

论文摘要

视频摩西需要对位于序列遥远时间点处的重叠帧进行注册，以确保重建场景的全局一致性。但是，当图像本身的注册很困难时，这种长期对完全自动化的注册是（i）挑战；（ii）由于大量候选对注册的对，对于长序列而言，计算上的昂贵。在本文中，我们引入了一个有效的框架，用于以序列的远程成对对应关系进行主动注释。我们的框架提出了一对图像，这些图像对Oracle代理（例如，人用户或可靠的匹配算法）提供了信息，它们在每个建议的一对上都提供了视觉对应关系。根据迭代策略，基于原则性注释奖励，以及两个互补和在线适应性的框架重叠模型来检索信息对。除了有效的马赛克构造外，我们的框架还提供了可用于评估或学习目的的副产品地面地标对应。我们通过合成序列的实验，用于航空成像的公开数据集以及在胎儿手术期间进行胎盘镶嵌的临床数据集，在自动和交互式场景中评估我们的方法。

Video mosaicking requires the registration of overlapping frames located at distant timepoints in the sequence to ensure global consistency of the reconstructed scene. However, fully automated registration of such long-range pairs is (i) challenging when the registration of images itself is difficult; and (ii) computationally expensive for long sequences due to the large number of candidate pairs for registration. In this paper, we introduce an efficient framework for the active annotation of long-range pairwise correspondences in a sequence. Our framework suggests pairs of images that are sought to be informative to an oracle agent (e.g., a human user, or a reliable matching algorithm) who provides visual correspondences on each suggested pair. Informative pairs are retrieved according to an iterative strategy based on a principled annotation reward coupled with two complementary and online adaptable models of frame overlap. In addition to the efficient construction of a mosaic, our framework provides, as a by-product, ground truth landmark correspondences which can be used for evaluation or learning purposes. We evaluate our approach in both automated and interactive scenarios via experiments on synthetic sequences, on a publicly available dataset for aerial imaging and on a clinical dataset for placenta mosaicking during fetal surgery.

下载PDF全文

下载文献需遵守相关版权规定

论文标题