论文标题
EVIMO2:一个用于运动分割,光流,运动结构的事件摄像头数据集,以及带有单眼或立体声算法的室内场景中的视觉惯性进程。
EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms
论文作者
论文摘要
引入了一个新的事件摄像机数据集EVIMO2,该数据集在更复杂的情况下从Better Cameras提供更多数据,从而改善了流行的EVIMO数据集。与其前身一样,EVIMO2以每像素地面真相深度和分割以及相机和对象姿势的形式提供标签。所有序列都使用物理相机中的数据,许多序列具有多个独立移动的对象。通常,这种标记的数据在物理事件相机数据集中不可用。因此,EVIMO2将成为现有算法和丰富培训的新算法的挑战基准。特别是,EVIMO2适用于单眼或立体声构型中的运动和物体分割,光流,运动结构以及视觉(惯性)探光仪的研究。 EVIMO2由来自三个640 $ \ times $ 480事件摄像机的41分钟数据组成,一个2080 $ \ times $ 1552经典的彩色摄像头,来自两个六个六轴惯性测量单元的惯性测量以及来自Vicon运动捕获系统的毫米精确对象。数据集的173个序列分为三类。 3.75分钟独立移动的家居物体,22.55分钟的静态场景以及浅色场景中的14.85分钟基本动作。某些序列记录在常规摄像机失败的低光条件下。事件摄像机以60 Hz的形式提供深度和细分,对于经典摄像头提供30 Hz。可以使用开源代码将口罩重新生成高达200 Hz。 该技术报告简要描述了EVIMO2。完整的文档可在线提供。可以在下载页面上对单个序列的视频进行采样。
A new event camera dataset, EVIMO2, is introduced that improves on the popular EVIMO dataset by providing more data, from better cameras, in more complex scenarios. As with its predecessor, EVIMO2 provides labels in the form of per-pixel ground truth depth and segmentation as well as camera and object poses. All sequences use data from physical cameras and many sequences feature multiple independently moving objects. Typically, such labeled data is unavailable in physical event camera datasets. Thus, EVIMO2 will serve as a challenging benchmark for existing algorithms and rich training set for the development of new algorithms. In particular, EVIMO2 is suited for supporting research in motion and object segmentation, optical flow, structure from motion, and visual (inertial) odometry in both monocular or stereo configurations. EVIMO2 consists of 41 minutes of data from three 640$\times$480 event cameras, one 2080$\times$1552 classical color camera, inertial measurements from two six axis inertial measurement units, and millimeter accurate object poses from a Vicon motion capture system. The dataset's 173 sequences are arranged into three categories. 3.75 minutes of independently moving household objects, 22.55 minutes of static scenes, and 14.85 minutes of basic motions in shallow scenes. Some sequences were recorded in low-light conditions where conventional cameras fail. Depth and segmentation are provided at 60 Hz for the event cameras and 30 Hz for the classical camera. The masks can be regenerated using open-source code up to rates as high as 200 Hz. This technical report briefly describes EVIMO2. The full documentation is available online. Videos of individual sequences can be sampled on the download page.