流动且快速流动：学习以整流流量生成和传输数据

论文标题

流动且快速流动：学习以整流流量生成和传输数据

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

论文作者

Liu, Xingchao, Gong, Chengyue, Liu, Qiang

论文摘要

我们提出了整流的流程，这是一种令人惊讶的简单学习方法（神经）的普通微分方程（ODE）模型，用于在两个经验观察到的分布π_0和π_1之间运输，因此在涉及分布运输的各种任务中为生成建模和域传递提供了统一的解决方案。整流流的想法是学习ode，以遵循尽可能多的连接π_0和π_1点的直径。这是通过解决直接的非线性最小二乘优化问题来实现的，该问题可以轻松地缩放到大型模型，而无需在标准监督学习之外引入额外的参数。直径是特殊的，因此是特殊的，因为它们是两个点之间的最短路径，并且可以精确模拟而无需时间离散，因此可以在计算上产生高效的模型。我们表明，从数据（称为整流）中学习的整流流的过程将π_0和π_1的任意耦合转变为新的确定性耦合，并证明是非侵入的凸传输成本。此外，递归应用矫正使我们能够获得具有越来越直的路径的流动序列，可以在推理阶段进行粗略的时间离散化来准确模拟。在经验研究中，我们表明，整流流在图像产生，图像到图像翻译和域的适应性上表现出色。特别是，在图像生成和翻译上，我们的方法几乎产生了几乎直流的流，即使单一的Euler离散步骤，也会产生高质量的结果。

We present rectified flow, a surprisingly simple approach to learning (neural) ordinary differential equation (ODE) models to transport between two empirically observed distributions π_0 and π_1, hence providing a unified solution to generative modeling and domain transfer, among various other tasks involving distribution transport. The idea of rectified flow is to learn the ODE to follow the straight paths connecting the points drawn from π_0 and π_1 as much as possible. This is achieved by solving a straightforward nonlinear least squares optimization problem, which can be easily scaled to large models without introducing extra parameters beyond standard supervised learning. The straight paths are special and preferred because they are the shortest paths between two points, and can be simulated exactly without time discretization and hence yield computationally efficient models. We show that the procedure of learning a rectified flow from data, called rectification, turns an arbitrary coupling of π_0 and π_1 to a new deterministic coupling with provably non-increasing convex transport costs. In addition, recursively applying rectification allows us to obtain a sequence of flows with increasingly straight paths, which can be simulated accurately with coarse time discretization in the inference phase. In empirical studies, we show that rectified flow performs superbly on image generation, image-to-image translation, and domain adaptation. In particular, on image generation and translation, our method yields nearly straight flows that give high quality results even with a single Euler discretization step.

下载PDF全文

下载文献需遵守相关版权规定

论文标题