haru · Oct 10, 2022 · 1:45 AM UTC

haru · Oct 10, 2022 · 1:45 AM UTC

haru

10 Oct 2022

just finetuned the SD 1.4 vae on a bunch of anime-styled images and finally it can reconstruct eyes and fingers muuuuch better! vae is available here: huggingface.co/hakurei/waifu…

Oct 10, 2022 · 1:45 AM UTC

322

haru · Oct 10, 2022 · 1:55 AM UTC

haru @haruu1367

10 Oct 2022

massive thanks to @mahdimc for providing the compute used for finetuning! here's some more examples with the original image on the left, SD 1.4 VAE reconstruction in the middle, and the finetuned VAE on the right:

maxine · Oct 10, 2022 · 3:53 AM UTC

maxine @aicrumb

10 Oct 2022

Replying to @haruu1367

extremely excited, i've wanted to do this for so long

haru · Oct 10, 2022 · 4:00 AM UTC

haru @haruu1367

10 Oct 2022

yeah im surprised no one has done it yet, it seems more efficient than finetuning SD on image embeddings

Jonathan Chang (e/acc) · Oct 10, 2022 · 2:12 AM UTC

Jonathan Chang (e/acc) @cccntu

10 Oct 2022

Replying to @haruu1367 @_akhaliq

Did you fine tune the encoder together with the decoder, before fine tuning the unet?

haru · Oct 11, 2022 · 11:35 AM UTC

haru @haruu1367

11 Oct 2022

the unet has not been touched at all, just the encoder and decoder. i'm still finetuning it more now with the encoder frozen. will post more results later