-
https://research.google/blog/training-generalist-agents-with-multi-game-decision-transformers/
-
Gato: A Generalist Agent
-
Attention Is All You Need
-
Decision Transformer: Reinforcement Learning via Sequence Modeling
-
https://sites.google.com/view/multi-game-transformers
-
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
-
https://arxiv.org/pdf/2205.15241.pdf#page=8
-
https://arxiv.org/pdf/2205.15241.pdf#page=7
-
โ โGPTโ directory
-
https://arxiv.org/pdf/2205.15241.pdf#page=17
-
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
-
Decoupled Weight Decay Regularization
-
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
-
https://arxiv.org/pdf/2205.15241.pdf#page=21
-
An Optimistic Perspective on Offline Reinforcement Learning
-
https://diyhpl.us/~nmz787/pdf/Human-level_control_through_deep_reinforcement_learning.pdf
-
Off-Policy Deep Reinforcement Learning without Exploration
-
Scaling Laws for Neural Language Models
-
PaLM: Scaling Language Modeling with Pathways
-
PlaNet: Learning Latent Dynamics for Planning from Pixels
-
Dream to Control: Learning Behaviors by Latent Imagination
-
DreamerV2: Mastering Atari with Discrete World Models
-
Scaling Laws for Autoregressive Generative Modeling
-
From Motor Control to Team Play in Simulated Humanoid Football
-
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
-
GPT-3: Language Models are Few-Shot Learners
-
Podracer architectures for scalable Reinforcement Learning
-