"the moon overrun by flowers, tempera" A grid of 256 samples, made in 4 minutes 34 seconds (20 steps PLMS).
Feb 9, 2022 · 5:33 AM UTC
The 4 minutes 34 seconds was on *one GPU* btw (an A100). The sampling takes four model outputs per timestep for the first three timesteps and one model output per timestep for all subsequent timesteps, so it did computation equivalent to 29 DDIM steps.
It's using the fast sampling method from "Pseudo Numerical Methods for Diffusion Models on Manifolds" (openreview.net/forum?id=PlKW…), which consist of diffusion-specialized versions of higher order ODE solvers.