18 users here now
Please post the text you used to get the images you've created.
Come for the cats, stay for the empathy.
and start exploring.
New text-to-image AI models ruDALL-E. Example from ruDALL-E Malevich (XL): "a red car" (translated to Russian). Links in a comment. (i.redd.it)
submitted 3 months ago by Wiskkey
Post a comment!
[–]Wiskkey[S] 6 points7 points8 points 3 months ago* (26 children)
Technical report (Russian).
Technical report (translated to English by Google Translate).
English language article that is similar to the technical report.
English language demo for ruDALL-E Malevich (XL).
English language ruDALL-E home page.
GitHub repo for ruDALL-E Malevich (XL).
Google Colab notebook ruDALLE-example-generation.
Google Colab notebook ruDALLE-example-generation-A100.
Google Colab notebook ruDALLE-image-prompts-A100.
Notebook at Kaggle.
From the 2nd link:
We trained two versions of the model of different sizes and gave them the names of the great Russian abstract artists - Wassily Kandinsky and Kazimir Malevich: [1]. ruDALL-E Kandinsky (XXL) with 12 billion parameters; [2]. ruDALL-E Malevich (XL) containing 1.3 billion parameters.
We trained two versions of the model of different sizes and gave them the names of the great Russian abstract artists - Wassily Kandinsky and Kazimir Malevich:
[1]. ruDALL-E Kandinsky (XXL) with 12 billion parameters;
[2]. ruDALL-E Malevich (XL) containing 1.3 billion parameters.
The base output appears to be at 256x256, but this version of Real-ESRGAN is apparently used to upscale the images in the demo.
Input for the demo apparently needs to be in Russian, and is not auto-translated. Here is a language translator.
[–]Wiskkey[S] 2 points3 points4 points 3 months ago (0 children)
Colab notebook "Optimized Image Prompts" from stomperhomp (reference).
Kaggle notebook "Optimized Image Prompts" from stomperhomp (reference)
[–]Wiskkey[S] 1 point2 points3 points 3 months ago* (0 children)
A notebook from ouhenio is at https://www.reddit.com/r/MachineLearning/comments/qlbye5/comment/hj1u215/.
[–]Wiskkey[S] 1 point2 points3 points 3 months ago (0 children)
"Optimized ruDALLE v2.0" by stomperhomp.
[–]Wiskkey[S] 1 point2 points3 points 3 months ago (1 child)
Colab notebook "ruDALLE-example-generation w/ Caching" (Twitter reference)
[–]theRIAA 1 point2 points3 points 3 months ago (0 children)
thinking woman statue logo. > думающая женщина статуя логотип.
1hr on P100, images_num = 18 on all batches = 144
Colab Rudalle-image-prompts-dress-mannequins-v100.
Colab Philipuss's ruDALLE Notebook.
[–]Wiskkey[S] 1 point2 points3 points 2 months ago (0 children)
New Colab notebook mentioned in the GitHub repo: Malevich-3.5GB-vRAM-usage.
Colab notebook UnenBubb-ruDALLE-generation. Reddit reference.
Colab notebook "ruDALLE-Outpainting" from eyaler.
[–]Wiskkey[S] 1 point2 points3 points 1 month ago (0 children)
Colab Looking Glass v1.1 for finetuning.
[–]Wiskkey[S] 0 points1 point2 points 3 months ago (0 children)
Kinda-English ruDALL-E.
The demo site now has an English user interface here.
Optimized ruDALLE v2.0 (Kaggle).
Colab notebook Text2Image_v4.
[–]Wiskkey[S] 0 points1 point2 points 3 months ago* (0 children)
Colab notebook Dalle_finetune_16gb.
Colab notebook Eng_pipeline_ruDALLE-example-generation.
[–]Wiskkey[S] 0 points1 point2 points 2 months ago (0 children)
"Tired of Colab K80. So I create a repo of PaddlePaddle version of ruDalle. The weights were converted in order to use the free v100 inference. [...]"
4 new ruDALL-E notebooks (2 Google Colab, 2 Kaggle) from stomperhomp for arbitrary resolution image generation, and for finetuning.
Emojich: finetuned ruDALL-E on emojis.
[–]Wiskkey[S] 0 points1 point2 points 1 month ago (0 children)
Colab ruDALL-E Mass Batcher.
"ruDALL-E diffusion" from Jack000 uses a diffusion model to alter images generated by ruDALL-E.
"Malevich ruDALL-E with ESRGAN" from ray.krehn.
NeuralWardrobe. Twitter reference.
ruDALLE arbitrary resolution v2.0 from stomperhomp. Twitter reference.
[–]theRIAA 2 points3 points4 points 3 months ago* (0 children)
First two prompts I tried: a sturdy red chair an armchair in the shape of an avacado. an armchair imitating an avacado.
Pretty groundbreaking. topk=512 > 4.6 min each on P100.
This seems now good enough to be used as product design inspiration. It might prefer different prompting style than the original Dall-E.
edit: here is a better translator that also allows ru_to_en:
!pip install -U deep_translator import time from deep_translator import GoogleTranslator, MyMemoryTranslator # langs_dict = GoogleTranslator.get_supported_languages(as_dict=True) # print(langs_dict) text = 'text to translate' tService = GoogleTranslator #GoogleTranslator, MyMemoryTranslator translated = tService(source='en', target='ru').translate(text) time.sleep(1) rev_translated = tService(source='ru', target='en').translate(translated) print(f'original: {text}\ntranslted: {translated}\nrev-tran: {rev_translated}') text = translated
Reverse translation is very useful to confirm the intention of your prompt. I used this a lot for CogView.
удобное кресло в форме авокадо. rev-tran: comfortable armchair in the shape of an avocado. (512, 0.97, 3)
π Rendered by PID 68530 on reddit-service-r2-loggedout-bd66dd65c-wbnls at 2022-02-08 02:59:59.375167+00:00 running 94ecb5b country code: US.
Want to add to the discussion?
Post a comment!