TalkRL: The Reinforcement Learning Podcast: Aravind Srinivas 2: Aravind Srinivas, Research Scientist at OpenAI, Returns to Talk Decision Transformer, VideoGPT, Choosing Problems, and Explore vs Exploit in Research Careers
ODT: Online Decision Transformer
Attention Is All You Need
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Decision Transformer: Reinforcement Learning via Sequence Modeling
GPT-3 Creative Fiction § Prompts As Programming
Openai/gym: A Toolkit for Developing and Comparing Reinforcement Learning Algorithms.
https://kzl.github.io/assets/decision_transformer.pdf
https://github.com/kzl/decision-transformer
MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Reinforcement Learning Upside Down: Don’t Predict Rewards—Just Map Them to Actions
Learning Relative Return Policies With Upside-Down Reinforcement Learning
A Very Unlikely Chess Game
Transformers Play Chess
The Value Equivalence Principle for Model-Based Reinforcement Learning
Shaking the foundations: delusions in sequence models for interaction and control
Trajectory Transformer: Reinforcement Learning as One Big Sequence Modeling Problem
GPT-2 Preference Learning for Music Generation § Decision Transformers: Preference Learning As Simple As Possible
rnn-metadata#inline-metadata-trick
[Transclude the forward-link's
context]
CTRL: A Conditional Transformer Language Model For Controllable Generation
Towards a Human-like Open-Domain Chatbot
Controllable Generation from Pre-trained Language Models via Inverse Prompting
https://architext.design/about/
DALL·E 1: Creating Images from Text: We’ve trained a neural network called DALL·E that creates images from text captions for a wide range of concepts expressible in natural language
CLIP: Connecting Text and Images: We’re introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision. CLIP can be applied to any visual classification benchmark by simply providing the names of the visual categories to be recognized, similar to the ‘zero-shot’ capabilities of GPT-2 and GPT-3
CogView: Mastering Text-to-Image Generation via Transformers
Choose-Your-Own-Adventure AI Dungeon Games