Bibliography (7):

  1. https://x.com/aviral_kumar2/status/1887764754539614648

  2. Scaling laws for single-agent reinforcement learning

  3. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

  4. Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

  5. Parallel Q-Learning (PQL): Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation

  6. Wikipedia Bibliography:

    1. Pareto front

    2. OpenAI