CogView: Mastering Text-to-Image Generation via Transformers
pdf: arxiv.org/pdf/2105.13290.pdf
abs: arxiv.org/abs/2105.13290
demo: lab.aminer.cn/cogview/index.…
a 4-billion-parameter Transformer with VQ-VAE tokenizer, achieves a new SOTA FID on blurred MS COCO
May 28, 2021 · 12:52 AM UTC