Rivers Have Wings · Oct 29, 2022 · 4:36 PM UTC

Rivers Have Wings · Oct 29, 2022 · 4:36 PM UTC

Rivers Have Wings

29 Oct 2022

Playing with BigGAN again, I think I have worked out how to do Markov Chain Monte Carlo w/ a CLIP posterior in its extended latent space (incl. a good parameterization and prior for the class vectors...)

Oct 29, 2022 · 4:36 PM UTC

156

Rivers Have Wings · Oct 30, 2022 · 3:47 AM UTC

Rivers Have Wings @RiversHaveWings

30 Oct 2022

I think the most important thing about the class vectors is their norm. The model was trained on one-hots, which are 2-norm 1. So I just used a Gaussian prior for them. It doesn't seem to matter that much that the model hadn't seen negative classes.

Rivers Have Wings · Oct 30, 2022 · 3:49 AM UTC

Rivers Have Wings @RiversHaveWings

30 Oct 2022

To keep my parameters well-scaled I made their prior distribution N(0, I) and sampled their initial values from it. Then I multiplied them by 1 / sqrt(1000) - 1000 classes in ImageNet - before feeding them to the model, so its soft class mixes would have a 2-norm around 1.

Rivers Have Wings · Oct 30, 2022 · 3:51 AM UTC

Rivers Have Wings @RiversHaveWings

30 Oct 2022

This was successful at de-dogging the model! At first I had used @advadnoun's initial distribution for the prior and sigmoided them before feeding to the model, following Big Sleep. But that resulted in the model not being able to get away from dogs!

Rivers Have Wings · Oct 30, 2022 · 10:03 AM UTC

Rivers Have Wings @RiversHaveWings

30 Oct 2022

I really need to try MCMC/Langevin dynamics on a StyleGAN (2, 3, XL) next...

shem · Oct 29, 2022 · 5:17 PM UTC

shem @shemspemslem

29 Oct 2022

Replying to @RiversHaveWings

Which MCMC algorithm did you use? HMC/SGLD? Would love to know more

Rivers Have Wings · Oct 29, 2022 · 7:00 PM UTC

Rivers Have Wings @RiversHaveWings

29 Oct 2022

I've been trying several now that I worked out the class vectors. The most successful so far is actually diffusion based (so pretty similar to Langevin dynamics), the second most is just SGLD. :)

Justin Trimble · Oct 30, 2022 · 3:56 AM UTC

Justin Trimble

@justintrimble

30 Oct 2022

Replying to @RiversHaveWings

These are beautiful