Bibliography (7):

  1. Deep Reinforcement Learning without Experience Replay, Target Networks, or Batch Updates

  2. https://mujoco.org/

  3. dm_control: Software and Tasks for Continuous Control

  4. The Arcade Learning Environment: An Evaluation Platform for General Agents