“‘Model-Free RL’ Tag”,2018-12-12
![]()
Bibliography for tag
reinforcement-learning/model-free, most recent first: 2 related tags, 288 annotations, & 25 links (parent).
- See Also
- Links
- “Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates”, et al 2024
- “Centaur: a Foundation Model of Human Cognition”, et al 2024
- “SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning”, et al 2024
- “Training Language Models to Self-Correct via Reinforcement Learning”, et al 2024
- “Carpentopod: A Walking Table Project”, 2024
- “Mind Wandering During Implicit Learning Is Associated With Increased Periodic EEG Activity And Improved Extraction Of Hidden Probabilistic Patterns”, et al 2024
- “Alexa Is in Millions of Households—And Amazon Is Losing Billions”, 2024
- “Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo”, et al 2024
- “Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data”, et al 2024
- “CodeIt: Self-Improving Language Models With Prioritized Hindsight Replay”, et al 2024
- “ReST Meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent”, et al 2023
- “Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning”, et al 2023
- “Let Models Speak Ciphers: Multiagent Debate through Embeddings”, et al 2023
- “Predictive Auxiliary Objectives in Deep RL Mimic Learning in the Brain”, 2023
- “Small Batch Deep Reinforcement Learning”, Obando- et al 2023
- “Subwords As Skills: Tokenization for Sparse-Reward Reinforcement Learning”, et al 2023
- “Comparative Study of Model-Based and Model-Free Reinforcement Learning Control Performance in HVAC Systems”, 2023
- “What Are Dreams For? Converging Lines of Research Suggest That We Might Be Misunderstanding Something We Do Every Night of Our Lives”, 2023
- “Learning to Model the World With Language”, et al 2023
- “Low-Poly Image Generation Using Evolutionary Algorithms in Ruby”
- “Using Temperature to Analyze the Neural Basis of a Time-Based Decision”, et al 2023
- “Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks”, Chevalier- et al 2023
- “Twitching in Sensorimotor Development from Sleeping Rats to Robots”, et al 2023
- “Universal Mechanical Polycomputation in Granular Matter”, et al 2023
- “Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning”, et al 2023
- “Improving Language Models With Advantage-Based Offline Policy Gradients”, et al 2023
- “Reinforcement Learning in Newcomb-Like Environments”, et al 2023
- “WizardLM: Empowering Large Language Models to Follow Complex Instructions”, et al 2023
- “Bridging Discrete and Backpropagation: Straight-Through and Beyond”, et al 2023
- “Empirical Design in Reinforcement Learning”, et al 2023
- “A Circuit Mechanism Linking past and Future Learning through Shifts in Perception”, et al 2023
- “Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier”, et al 2023
- “Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula”, et al 2022
- “Melting Pot 2.0”, et al 2022
- “Token Turing Machines”, et al 2022
- “Legged Locomotion in Challenging Terrains Using Egocentric Vision”, et al 2022
- “Over-Communicate No More: Situated RL Agents Learn Concise Communication Protocols”, et al 2022
- “E3B: Exploration via Elliptical Episodic Bonuses”, et al 2022
- “Hyperbolic Deep Reinforcement Learning”, et al 2022
- “Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization”, et al 2022
- “Dynamic Prompt Learning via Policy Gradient for Semi-Structured Mathematical Reasoning”, et al 2022
- “Simplifying Model-Based RL: Learning Representations, Latent-Space Models, and Policies With One Objective (ALM)”, et al 2022
- “Human-Level Atari 200× Faster”, et al 2022
- “Nearest Neighbor Non-Autoregressive Text Generation”, et al 2022
- “A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning”, et al 2022
- “Learning to Generalize With Object-Centric Agents in the Open World Survival Game Crafter”, et al 2022
- “Improved Policy Optimization for Online Imitation Learning”, et al 2022
- “Offline RL for Natural Language Generation With Implicit Language Q Learning”, et al 2022
- “Fine-Grained Image Captioning With CLIP Reward”, et al 2022
- “Reward Bases: Instantaneous Reward Revaluation With Temporal Difference Learning”, et al 2022
- “Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi”, et al 2022
- “Quantifying and Alleviating Political Bias in Language Models”, et al 2022c
- “Machine Learning Helps Control Tokamak Plasmas”, 2022
- “Retrieval-Augmented Reinforcement Learning”, et al 2022
- “Policy Learning and Evaluation With Randomized Quasi-Monte Carlo”, et al 2022
- “A Data-Driven Approach for Learning to Control Computers”, et al 2022
- “Magnetic Control of Tokamak Plasmas through Deep Reinforcement Learning”, et al 2022
- “Why Should I Trust You, Bellman? The Bellman Error Is a Poor Replacement for Value Error”, et al 2022
- “Learning Dynamics and Generalization in Deep Reinforcement Learning”, et al 2022
- “Agile Locomotion via Model-Free Learning”, 2022
- “Amortized Noisy Channel Neural Machine Translation”, et al 2021
- “Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs”, 2021
- “Simple but Effective: CLIP Embeddings for Embodied AI”, et al 2021
- “Offline Reinforcement Learning With Implicit Q-Learning (IQL)”, et al 2021
- “Recurrent Model-Free RL Is a Strong Baseline for Many POMDPs”, et al 2021
- “DroQ: Dropout Q-Functions for Doubly Efficient Reinforcement Learning”, et al 2021
- “Batch Size-Invariance for Policy Optimization”, et al 2021
- “MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research”, et al 2021
- “Bootstrapped Meta-Learning”, et al 2021
- “Megaverse: Simulating Embodied Agents at One Million Experiences per Second”, et al 2021
- “PES: Unbiased Gradient Estimation in Unrolled Computation Graphs With Persistent Evolution Strategies”, et al 2021
- “Multi-Task Curriculum Learning in a Complex, Visual, Hard-Exploration Domain: Minecraft”, et al 2021
- “On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning”, et al 2021
- “Constructions in Combinatorics via Neural Networks”, 2021
- “Muesli: Combining Improvements in Policy Optimization”, et al 2021
- “Podracer Architectures for Scalable Reinforcement Learning”, et al 2021
- “Counter-Strike Deathmatch With Large-Scale Behavioral Cloning”, 2021
- “ALD: Efficient Transformers in Reinforcement Learning Using Actor-Learner Distillation”, 2021
- “Replay in Deep Learning: Current Approaches and Missing Biological Elements”, et al 2021
- “Large Batch Simulation for Deep Reinforcement Learning”, et al 2021
- “The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games”, et al 2021
- “Reinforcement Learning for Datacenter Congestion Control”, et al 2021
- “Training Larger Networks for Deep Reinforcement Learning”, et al 2021
- “How RL Agents Behave When Their Actions Are Modified”, 2021
- “A✱ Search Without Expansions: Learning Heuristic Functions With Deep Q-Networks”, et al 2021
- “Randomized Ensembled Double Q-Learning (REDQ): Learning Fast Without a Model”, et al 2021
- “MLGO: a Machine Learning Guided Compiler Optimizations Framework”, et al 2021
- “Evolving Reinforcement Learning Algorithms”, Co- et al 2021
- “Using Deep Reinforcement Learning to Reveal How the Brain Encodes Abstract State-Space Representations in High-Dimensional Environments”, 2020
- “Autonomous Navigation of Stratospheric Balloons Using Reinforcement Learning”, et al 2020
- “A Unified Framework for Dopamine Signals across Timescales”, et al 2020
- “Offline Learning from Demonstrations and Unlabeled Experience”, et al 2020
- “Adversarial Vulnerabilities of Human Decision-Making”, et al 2020
- “D2RL: Deep Dense Architectures in Reinforcement Learning”, et al 2020
- “Human-Centric Dialog Training via Offline Reinforcement Learning”, et al 2020
- “Emergent Social Learning via Multi-Agent Reinforcement Learning”, et al 2020
- “Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning”, et al 2020
- “SPR: Data-Efficient Reinforcement Learning With Self-Predictive Representations”, et al 2020
- “Learning Breakout From RAM—Part 2”, philoxenic 2020
- “Learning Breakout From RAM—Part 1”, philoxenic 2020
- “Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks”, et al 2020
- “Improving GAN Training With Probability Ratio Clipping and Sample Reweighting”, et al 2020
- “Conservative Q-Learning for Offline Reinforcement Learning”, et al 2020
- “Controlling Overestimation Bias With Truncated Mixture of Continuous Distributional Quantile Critics (TQC)”, et al 2020
- “Evaluating the Rainbow DQN Agent in Hanabi With Unseen Partners”, et al 2020
- “Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels”, et al 2020
- “Chip Placement With Deep Reinforcement Learning”, et al 2020
- “CURL: Contrastive Unsupervised Representations for Reinforcement Learning”, et al 2020
- “Evolving Normalization-Activation Layers”, et al 2020
- “Benchmarking End-To-End Behavioral Cloning on Video Games”, et al 2020
- “Agent57: Outperforming the Atari Human Benchmark”, et al 2020
- “Deep Neuroethology of a Virtual Rodent”, et al 2020
- “Q✱ Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison”, 2020
- “Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?”, et al 2020
- “Causal Evidence Supporting the Proposal That Dopamine Transients Function As Temporal Difference Prediction Errors”, et al 2020
- “A Distributional Code for Value in Dopamine-Based Reinforcement Learning”, et al 2020
- “Combining Q-Learning and Search With Amortized Value Estimates”, et al 2019
- “SEED RL: Scalable and Efficient Deep-RL With Accelerated Central Inference”, et al 2019
- “Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?”, et al 2019
- “QUARL: Quantized Reinforcement Learning (ActorQ)”, et al 2019
- “Deep Learning Enables Rapid Identification of Potent DDR1 Kinase Inhibitors”, et al 2019
- “Exponential Slowdown for Larger Populations: The (μ+1)-EA on Monotone Functions”, 2019
- “Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the Playing Field”, et al 2019
- “A View on Deep Reinforcement Learning in System Optimization”, Haj- et al 2019
- “Playing the Lottery With Rewards and Multiple Languages: Lottery Tickets in RL and NLP”, et al 2019
- “A General Dichotomy of Evolutionary Algorithms on Monotone Functions”, 2019
- “A Recipe for Training Neural Networks”, 2019
- “Universal Quantum Control through Deep Reinforcement Learning”, et al 2019
- “Reinforcement Learning for Recommender Systems: A Case Study on Youtube”, 2019
- “Benchmarking Classic and Learned Navigation in Complex 3D Environments”, et al 2019
- “AutoPhase: Compiler Phase-Ordering for High Level Synthesis With Deep Reinforcement Learning”, Haj- et al 2019
- “Anxiety, Depression, and Decision Making: A Computational Perspective”, 2019
- “Reinforcement Learning in Artificial and Biological Systems”, 2019
- “Designing Neural Networks through Neuroevolution”, et al 2019
- “IRLAS: Inverse Reinforcement Learning for Architecture Search”, et al 2018
- “Quantifying Generalization in Reinforcement Learning”, et al 2018
- “Top-K Off-Policy Correction for a REINFORCE Recommender System”, et al 2018
- “Relative Entropy Regularized Policy Iteration”, et al 2018
- “ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware”, et al 2018
- “Neural Probabilistic Motor Primitives for Humanoid Control”, et al 2018
- “InstaNAS: Instance-Aware Neural Architecture Search”, et al 2018
- “A Closer Look at Deep Policy Gradients”, et al 2018
- “One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets With RL”, et al 2018
- “Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow”, et al 2018
- “Learning to Perform Local Rewriting for Combinatorial Optimization”, 2018
- “R2D2: Recurrent Experience Replay in Distributed Reinforcement Learning”, et al 2018
- “Benchmarking Reinforcement Learning Algorithms on Real-World Robots”, et al 2018
- “Deterministic Implementations for Reproducibility in Deep Reinforcement Learning”, et al 2018
- “Multi-Task Deep Reinforcement Learning With PopArt”, et al 2018
- “Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction”, 2018
- “Searching Toward Pareto-Optimal Device-Aware Neural Architectures”, et al 2018
- “A Study of Reinforcement Learning for Neural Machine Translation”, et al 2018
- “Improving Abstraction in Text Summarization”, et al 2018
- “Learning to Optimize Join Queries With Deep Reinforcement Learning”, et al 2018
- “InfoNCE: Representation Learning With Contrastive Predictive Coding (CPC)”, et al 2018
- “Is Q-Learning Provably Efficient?”, et al 2018
- “Maximum a Posteriori Policy Optimization”, et al 2018
- “The Unusual Effectiveness of Averaging in GAN Training”, et al 2018
- “Resource-Efficient Neural Architect”, et al 2018
- “DVRL: Deep Variational Reinforcement Learning for POMDPs”, et al 2018
- “Playing Atari With Six Neurons”, et al 2018
- “Measuring the Intrinsic Dimension of Objective Landscapes”, et al 2018
- “DP4G: Distributed Distributional Deterministic Policy Gradients”, Barth- et al 2018
- “Optimizing Query Evaluations Using Reinforcement Learning for Web Search”, et al 2018
- “Delayed Impact of Fair Machine Learning”, et al 2018
- “Accelerated Methods for Deep Reinforcement Learning”, 2018
- “Learning Memory Access Patterns”, et al 2018
- “Investigating Human Priors for Playing Video Game”, et al 2018
- “ME-TRPO: Model-Ensemble Trust-Region Policy Optimization”, et al 2018
- “TD3: Addressing Function Approximation Error in Actor-Critic Methods”, et al 2018
- “Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration”, et al 2018
- “Unicorn: Continual Learning With a Universal, Off-Policy Agent”, et al 2018
- “ENAS: Efficient Neural Architecture Search via Parameter Sharing”, et al 2018
- “Regularized Evolution for Image Classifier Architecture Search”, et al 2018
- “IMPALA: Scalable Distributed Deep-RL With Importance Weighted Actor-Learner Architectures”, et al 2018
- “Interactive Grounded Language Acquisition and Generalization in a 2D World”, et al 2018
- “Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning With a Stochastic Actor”, et al 2018
- “Chapter 5: Monte Carlo Methods”, 2018 (page 133)
- “Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning”, et al 2017
- “The Case for Learned Index Structures”, et al 2017
- “AI Safety Gridworlds”, et al 2017
- “Classification With Costly Features Using Deep Reinforcement Learning”, et al 2017
- “Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection”, 2017
- “Towards the Use of Deep Reinforcement Learning With Global Policy For Query-Based Extractive Summarization”, 2017
- “Swish: Searching for Activation Functions”, et al 2017
- “Gradient-Free Policy Architecture Search and Adaptation”, et al 2017
- “Rainbow: Combining Improvements in Deep Reinforcement Learning”, et al 2017
- “OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning”, et al 2017
- “Deep Reinforcement Learning That Matters”, et al 2017
- “Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents”, et al 2017
- “Seq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning”, et al 2017
- “The Successor Representation in Human Reinforcement Learning”, et al 2017
- “Practical Block-Wise Neural Network Architecture Generation”, et al 2017
- “Learning Policies for Adaptive Tracking With Deep Feature Cascades”, et al 2017
- “Reinforced Video Captioning With Entailment Rewards”, 2017
- “A Distributional Perspective on Reinforcement Learning”, et al 2017
- “Tracking As Online Decision-Making: Learning a Policy from Streaming Videos With Reinforcement Learning”, III & 2017
- “Trial without Error: Towards Safe Reinforcement Learning via Human Intervention”, et al 2017
- “Efficient Architecture Search by Network Transformation”, et al 2017
- “Grammatical Error Correction With Neural Reinforcement Learning”, et al 2017
- “Noisy Networks for Exploration”, et al 2017
- “Gated-Attention Architectures for Task-Oriented Language Grounding”, et al 2017
- “The Persistence and Transience of Memory”, 2017
- “Deep Reinforcement Learning from Human Preferences § Appendix A.2: Atari”, et al 2017 (page 15 org openai)
- “Towards Synthesizing Complex Programs from Input-Output Examples”, et al 2017
- “IDK Cascades: Fast Deep Learning by Learning Not to Overthink”, et al 2017
- “Teaching Machines to Describe Images via Natural Language Feedback”, 2017
- “Learning Time/Memory-Efficient Deep Architectures With Budgeted Super Networks”, 2017
- “Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models”, et al 2017
- “Ask the Right Questions: Active Question Reformulation With Reinforcement Learning”, et al 2017
- “A Deep Reinforced Model for Abstractive Summarization”, et al 2017
- “Inferring and Executing Programs for Visual Reasoning”, et al 2017
- “Time-Contrastive Networks: Self-Supervised Learning from Video”, et al 2017
- “RAM: Dynamic Computational Time for Visual Attention”, et al 2017
- “Born to Learn: the Inspiration, Progress, and Future of Evolved Plastic Artificial Neural Networks (EPANNs)”, et al 2017
- “Improving Neural Machine Translation With Conditional Sequence Generative Adversarial Nets”, et al 2017
- “End-To-End Optimization of Goal-Driven and Visually Grounded Dialogue Systems”, et al 2017
- “Neural Episodic Control”, et al 2017
- “CoDeepNEAT: Evolving Deep Neural Networks”, et al 2017
- “Tuning Recurrent Neural Networks With Reinforcement Learning”, et al 2017
- “PathNet: Evolution Channels Gradient Descent in Super Neural Networks”, et al 2017
- “The Kelly Coin-Flipping Game: Exact Solutions”, et al 2017
- “Deep Reinforcement Learning: A Brief Survey”, et al 2017
- “Your TL;DR by an AI: A Deep Reinforced Model for Abstractive Summarization”, 2017
- “Loss Is Its Own Reward: Self-Supervision for Reinforcement Learning”, et al 2016
- “Self-Critical Sequence Training for Image Captioning”, et al 2016
- “Neural Combinatorial Optimization With Reinforcement Learning”, et al 2016
- “Reinforcement Learning With Unsupervised Auxiliary Tasks”, et al 2016
- “A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models”, et al 2016
- “Hybrid Computing Using a Neural Network With Dynamic External Memory”, et al 2016
- “Connecting Generative Adversarial Networks and Actor-Critic Methods”, 2016
- “Deep Reinforcement Learning for Mention-Ranking Coreference Models”, 2016
- “Deep Neural Networks for YouTube Recommendations”, et al 2016
- “The Malmo Platform for Artificial Intelligence Experimentation”, et al 2016
- “Progressive Neural Networks”, et al 2016
- “Learning to Optimize”, 2016
- “Deep Reinforcement Learning for Dialogue Generation”, et al 2016
- “ViZDoom: A Doom-Based AI Research Platform for Visual Reinforcement Learning”, et al 2016
- “Learning from the Memory of Atari 2600”, 2016
- “Improving Information Extraction by Acquiring External Evidence With Reinforcement Learning”, et al 2016
- “Asynchronous Methods for Deep Reinforcement Learning”, et al 2016
- “Dueling Network Architectures for Deep Reinforcement Learning”, et al 2015
- “Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning”, et al 2015
- “Prioritized Experience Replay”, et al 2015
- “Deep Reinforcement Learning With Double Q-Learning”, et al 2015
- “Gorila: Massively Parallel Methods for Deep Reinforcement Learning”, et al 2015
- “Reinforcement Learning Neural Turing Machines—Revised”, 2015
- “An Invitation to Imitation”, 2015
- “TRPO: Trust Region Policy Optimization”, et al 2015
- “DRAW: A Recurrent Neural Network For Image Generation”, et al 2015
- “Random Feedback Weights Support Learning in Deep Neural Networks”, et al 2014
- “Learning to Execute”, 2014
- “Does Temporal Discounting Explain Unhealthy Behavior? A Systematic Review and Reinforcement Learning Perspective”, et al 2014
- “Playing Atari With Deep Reinforcement Learning”, et al 2013
- “The Arcade Learning Environment: An Evaluation Platform for General Agents”, et al 2012
- “Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting”, et al 2012
- “Off-Policy Actor-Critic”, et al 2012
- “Neural Mechanisms of Speed-Accuracy Tradeoff”, 2012
- “DAgger: A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning”, et al 2010
- “Compositional Pattern Producing Networks: A Novel Abstraction of Development”, 2007
- “Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal”, 2005
- “It Takes Two Neurons To Ride a Bicycle”, 2004
- “Recent Developments in the Evolution of Morphologies and Controllers for Physically Simulated Creatures § A Re-Implementation of Sims’ Work Using the MathEngine Physics Engine”, 2001 (page 6)
- “Learning to Drive a Bicycle Using Reinforcement Learning and Shaping”, Randløv & 1998
- “6.6 Actor-Critic Methods”, 1998
- “Descriptor Predictive Control: Tracking Controllers for a Riderless Bicycle”, et al 1996
- “Control for an Autonomous Bicycle”, 1995
- “Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning”, 1992
- Proceedings of the First International Conference on Genetic Algorithms and Their Applications, 1985
- “Temporal Credit Assignment In Reinforcement Learning”, 1984
- “Experiments on the Mechanization of Game-Learning Part II. Rule-Based Learning and the Human Window [BOXES]”, 1982
- “Why the Law of Effect Will Not Go Away”, 1974
- “Experiments on the Mechanization of Game-Learning Part I. Characterization of the Model and Its Parameters [MENACE]”, 1963
- “A Matchbox Game-Learning Machine”, 1962
- “Some Studies in Machine Learning Using the Game of Checkers”, 1959
- “John Schulman’s Homepage”, 2024
- “LA Residents Complain about ‘Waze Craze’”
- “Sutton & Barto Book: Reinforcement Learning: An Introduction”, 2024
- “Evolving Stable Strategies”, 2024
- “Finding Nash Equilibria through Simulation”
- Trackmania I—The History of Machine Learning in Trackmania
- “The 37 Implementation Details of Proximal Policy Optimization”
- “Microsoft and Meta Join Google in Using AI to Help Run Their Data Centers”
- “Hedonic Loops and Taming RL”, 2024
- “Sony’s Racing Car AI Just Destroyed Its Human Competitors—By Being Nice (and Fast)”
- “Target-Driven Visual Navigation in Indoor Scenes Using Deep Reinforcement Learning [Video]”
- “Measuring the Intrinsic Dimension of Objective Landscapes [Video]”
- “Zyme—An Evolvable Language”
- Wikipedia
- Miscellaneous
- Bibliography