论文标题
评估视频供稿的时间查询
Evaluating Temporal Queries Over Video Feeds
论文作者
论文摘要
计算机视觉和深度学习的最新进展使从流视频帧中有效提取了模式。因此,可以生成一系列对象及其关联的类以及通过对象跟踪得出的唯一对象标识符,从而提供唯一的对象,因为它们在跨帧中捕获时提供了唯一的对象。在本文中,我们启动了涉及对象及其在视频提要中的共发生的时间查询的研究。例如,识别视频片段的查询在此期间相同的两辆红色汽车和相同的两个人共同出现了五分钟,这对于从执法到安全和安全性的许多应用都引起了人们的关注。我们采取了第一步,并以某种方式定义了此类查询,即它们结合了视频捕获的某些物理方面,例如对象遮挡。我们提出了一个由三层组成的体系结构,即对象检测/跟踪,中间数据生成和查询评估。我们提出了两种技术,即MFS和SSG,以在中间数据生成层中组织所有检测到的对象,这些对象有效地给定查询,可以最大程度地减少在查询评估过程中必须考虑的对象和框架的数量。我们还引入了一种称为状态遍历(ST)的算法,该算法对SSG进行了传入的框架,并有效地修剪对象和与查询评估无关的框架,同时维护所有简短查询评估所需的所有状态。我们介绍了利用实际和合成数据的彻底实验评估的结果,该数据建立了MFS和SSG之间的权衡。我们在评估中强调各种感兴趣的参数,并证明拟议的查询评估方法与所提出的算法能够有效地评估视频供稿的时间查询,从而实现了幅度级绩效效益的顺序。
Recent advances in Computer Vision and Deep Learning made possible the efficient extraction of a schema from frames of streaming video. As such, a stream of objects and their associated classes along with unique object identifiers derived via object tracking can be generated, providing unique objects as they are captured across frames. In this paper we initiate a study of temporal queries involving objects and their co-occurrences in video feeds. For example, queries that identify video segments during which the same two red cars and the same two humans appear jointly for five minutes are of interest to many applications ranging from law enforcement to security and safety. We take the first step and define such queries in a way that they incorporate certain physical aspects of video capture such as object occlusion. We present an architecture consisting of three layers, namely object detection/tracking, intermediate data generation and query evaluation. We propose two techniques,MFS and SSG, to organize all detected objects in the intermediate data generation layer, which effectively, given the queries, minimizes the number of objects and frames that have to be considered during query evaluation. We also introduce an algorithm called State Traversal (ST) that processes incoming frames against the SSG and efficiently prunes objects and frames unrelated to query evaluation, while maintaining all states required for succinct query evaluation. We present the results of a thorough experimental evaluation utilizing both real and synthetic data establishing the trade-offs between MFS and SSG. We stress various parameters of interest in our evaluation and demonstrate that the proposed query evaluation methodology coupled with the proposed algorithms is capable to evaluate temporal queries over video feeds efficiently, achieving orders of magnitude performance benefits.