论文标题
自适应网状视频稳定
Adaptively Meshed Video Stabilization
论文作者
论文摘要
视频稳定对于提高动荡视频的视觉质量至关重要。当前的视频稳定方法通常在背景中采用特征轨迹,以估算一个基于固定网格的一个全局变换矩阵或几个转换矩阵,并将摇动摇动框架变成稳定视图。但是,这些方法可能不会在复杂的场景中很好地建模摇摇欲坠的相机运动,例如包含大型前景物体或强烈视差的场景,并且可能导致稳定视频中显着的视觉伪像。为了解决上述问题,本文提出了一种适应性网格的方法,以根据其所有特征轨迹和自适应阻止策略稳定摇摇欲坠的视频。更具体地说,我们首先提取摇晃视频的特征轨迹,然后根据每个帧中特征轨迹的分布生成三角网格。然后计算出摇晃的框架之间的转换及其对网格所有三角形网格的稳定视图,以稳定摇晃的视频。由于通常可以从所有区域(包括背景区域和前景区域)中提取更多的特征轨迹,因此将获得更细的网格,并为摄像机运动估计和框架翘曲提供。我们通过解决两个阶段优化问题来估计每个帧基于网格的转换。此外,前景和背景特征轨迹不再区分,两者都有助于估算拟议的优化问题中的摄像头运动,这比以前的作品产生更好的估计性能,尤其是在具有较大前景对象或强烈视差的挑战性视频中。
Video stabilization is essential for improving visual quality of shaky videos. The current video stabilization methods usually take feature trajectories in the background to estimate one global transformation matrix or several transformation matrices based on a fixed mesh, and warp shaky frames into their stabilized views. However, these methods may not model the shaky camera motion well in complicated scenes, such as scenes containing large foreground objects or strong parallax, and may result in notable visual artifacts in the stabilized videos. To resolve the above issues, this paper proposes an adaptively meshed method to stabilize a shaky video based on all of its feature trajectories and an adaptive blocking strategy. More specifically, we first extract feature trajectories of the shaky video and then generate a triangle mesh according to the distribution of the feature trajectories in each frame. Then transformations between shaky frames and their stabilized views over all triangular grids of the mesh are calculated to stabilize the shaky video. Since more feature trajectories can usually be extracted from all regions, including both background and foreground regions, a finer mesh will be obtained and provided for camera motion estimation and frame warping. We estimate the mesh-based transformations of each frame by solving a two-stage optimization problem. Moreover, foreground and background feature trajectories are no longer distinguished and both contribute to the estimation of the camera motion in the proposed optimization problem, which yields better estimation performance than previous works, particularly in challenging videos with large foreground objects or strong parallax.