CogView: Mastering Text-to-Image Generation via Transformers pdf: arxiv.org/pdf/2105.13290.pdf abs: arxiv.org/abs/2105.13290 demo: lab.aminer.cn/cogview/index.… a 4-billion-parameter Transformer with VQ-VAE tokenizer, achieves a new SOTA FID on blurred MS COCO

May 28, 2021 · 12:52 AM UTC

Replying to @_akhaliq @ak92501
Is it just the demo that only supports simplified Chinese?