论文标题

用随机插值构建标准化流量

Building Normalizing Flows with Stochastic Interpolants

论文作者

Albergo, Michael S., Vanden-Eijnden, Eric

论文摘要

提出了基于任何一对基础和目标概率密度之间连续时间归一化流量的生成模型。该流量的速度场是根据在有限的时间内在基础和目标之间插值的时间相关密度的概率电流推断出来的。与传统的基于最大似然原理的标准化推理方法不同,需要通过ODE求解器进行昂贵的反向传播,我们的插值方法会导致速度本身的简单二次损失,这是根据预期表达的,这很容易被经验估计。该流程可用于从基座或目标产生样品,并沿着插入术的任何时间估算可能性。另外,可以优化流量以最大程度地减少插值密度的路径长度,从而为构建最佳传输图的道路铺平了道路。在基座是高斯密度的情况下,我们还表明,我们的归一流流量的速度也可用于构建扩散模型以采样目标并估计其得分。但是,我们的方法表明,我们可以完全绕过这种扩散,并以更简单的方式在概率流的水平上起作用,这仅基于普通微分方程作为基于随机微分方程的方法,为方法开辟了途径。密度估计任务的基准测试表明,学习流可以以一小部分成本匹配并超过常规的连续流,并与CIFAR-10和Imagenet上图像生成的扩散良好,并进行了很好的比较。该方法缩放Ab-Initio Ode流到以前无法到达的图像分辨率上,最高为$ 128 \ times128 $。

A generative model based on a continuous-time normalizing flow between any pair of base and target probability densities is proposed. The velocity field of this flow is inferred from the probability current of a time-dependent density that interpolates between the base and the target in finite time. Unlike conventional normalizing flow inference methods based the maximum likelihood principle, which require costly backpropagation through ODE solvers, our interpolant approach leads to a simple quadratic loss for the velocity itself which is expressed in terms of expectations that are readily amenable to empirical estimation. The flow can be used to generate samples from either the base or target, and to estimate the likelihood at any time along the interpolant. In addition, the flow can be optimized to minimize the path length of the interpolant density, thereby paving the way for building optimal transport maps. In situations where the base is a Gaussian density, we also show that the velocity of our normalizing flow can also be used to construct a diffusion model to sample the target as well as estimate its score. However, our approach shows that we can bypass this diffusion completely and work at the level of the probability flow with greater simplicity, opening an avenue for methods based solely on ordinary differential equations as an alternative to those based on stochastic differential equations. Benchmarking on density estimation tasks illustrates that the learned flow can match and surpass conventional continuous flows at a fraction of the cost, and compares well with diffusions on image generation on CIFAR-10 and ImageNet $32\times32$. The method scales ab-initio ODE flows to previously unreachable image resolutions, demonstrated up to $128\times128$.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源