Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
https://arxiv.org/abs/1711.00937
Generating Diverse High-Fidelity Images with VQ-VAE-2
MaskGIT: Masked Generative Image Transformer
UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes