ā€œDALLĀ·E-Bot: Introducing Web-Scale Diffusion Models to Roboticsā€, Ivan Kapelyukh, Vitalis Vosylius, Edward Johns2022-10-05 (, )⁠:

We introduce the first work to explore web-scale diffusion models for robotics. DALLĀ·E-Bot enables a robot to rearrange objects in a scene, by first inferring a text description of those objects, then generating an image representing a natural, human-like arrangement of those objects, and finally physically arranging the objects according to that image.

The importance is that we achieve this zero-shot using DALLĀ·E, without needing any further data collection or training.

Encouraging real-world results with human studies show that this is an exciting direction for the future of web-scale robot learning algorithms.

We also propose a list of recommendations to the text-to-image community, to align further developments of these models with applications to robotics.

Videos are available at: https://www.robot-learning.uk/dall-e-bot.