Decision Transformer: Reinforcement Learning via Sequence Modeling
Top-K Off-Policy Correction for a REINFORCE Recommender System
Reinforcement Learning for Recommender Systems: A Case Study on Youtube
note#advanced-chess-obituary
What If the City Ran Waze and You Had to Obey It? Could This Cure Congestion?
We Traveled to Holloman Air Force Base for a Glimpse of the Future of War-And the Future of Work
https://www.gutenberg.org/cache/epub/16550/pg16550-images.html#Cpage445
Traffic-Weary Homeowners and Waze Are at War, Again. Guess Who’s Winning?
‘Cut-Through’ Traffic Caused by Waze App Must Stop, L.A. Councilman Says
Probabilistic Integration: A Role in Statistical Computation?
Best-Arm Identification Algorithms for Multi-Armed Bandits in the Fixed Confidence Setting
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
The Power of Two Random Choices: A Survey of Techniques and Results
Hierarchical Object Detection With Deep Reinforcement Learning
Show, Attend and Tell: Neural Image Caption Generation With Visual Attention
Learning to Combine Foveal Glimpses With a Third-Order Boltzmann Machine
Neural Machine Translation by Jointly Learning to Align and Translate
Character-Level Language Modeling with Deeper Self-Attention
Multi-Scale Dense Networks for Resource Efficient Image Classification
IDK Cascades: Fast Deep Learning by Learning not to Overthink
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Learning Policies for Adaptive Tracking with Deep Feature Cascades
Learning with Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Deciding How to Decide: Dynamic Routing in Artificial Neural Networks
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions
Hybrid computing using a neural network with dynamic external memory
Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes
Simple statistical gradient-following algorithms for connectionist reinforcement learning
Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Learning to Organize Knowledge and Answer Questions with N-Gram Machines
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Machine Learning for Systems and Systems for Machine Learning
A View on Deep Reinforcement Learning in System Optimization
Optimizing Query Evaluations using Reinforcement Learning for Web Search
Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
MLGO: a Machine Learning Guided Compiler Optimizations Framework
Universal quantum control through deep reinforcement learning
MuZero with Self-competition for Rate Control in VP9 Video Compression
Learning to Perform Local Rewriting for Combinatorial Optimization
Learning a Large Neighborhood Search Algorithm for Mixed Integer Programs
A Full-stack Accelerator Search Technique for Vision Applications
The Deep Learning Revolution and Its Implications for Computer Architecture and Chip Design
Tuning Recurrent Neural Networks with Reinforcement Learning
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Mastering the game of Go with deep neural networks and tree search
AlphaGo Zero: Mastering the game of Go without human knowledge
Deep Reinforcement Learning for Mention-Ranking Coreference Models
Language as a Latent Variable: Discrete Generative Models for Sentence Compression
Neural Combinatorial Optimization with Reinforcement Learning
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets
End-to-end optimization of goal-driven and visually grounded dialogue systems
Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models
Deal or No Deal? End-To-End Learning for Negotiation Dialogues
Grammatical Error Correction with Neural Reinforcement Learning
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
Automatically Composing Representation Transformations as a Means for Generalization
A Study of Reinforcement Learning for Neural Machine Translation
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Learning to Optimize Join Queries With Deep Reinforcement Learning
Better Rewards Yield Better Summaries: Learning to Summarise Without References
Connecting Generative Adversarial Networks and Actor-Critic Methods
Asynchronous Network Architecture for Semi-Supervised Learning
Learning to Discover Cross-Domain Relations with Generative Adversarial Networks
Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks
Unsupervised Machine Translation Using Monolingual Corpora Only
Deep Reinforcement Learning for Accelerating the Convergence Rate
Rover Descent: Learning to optimize by learning to navigate on prototypical loss surfaces
Understanding and correcting pathologies in the training of learned optimizers
Online Batch Selection for Faster Training of Neural Networks
Neural Data Filter for Bootstrapping Stochastic Gradient Descent
ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks
Differentiable Learning-to-Normalize via Switchable Normalization
Bayesian Active Learning for Classification and Preference Learning
Active Learning for High Dimensional Inputs Using Bayesian Convolutional Neural Networks
Teaching Machines to Describe Images via Natural Language Feedback
Active Learning for Convolutional Neural Networks: A Core-Set Approach
Classification with Costly Features using Deep Reinforcement Learning
Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning
Brief Summary of the Panel Discussion at DL Workshop @ICML 2015
Machine Teaching for Bayesian Learners in the Exponential Family
"Less Than One"-Shot Learning: Learning n Classes From M < N Samples
Designing Neural Network Architectures using Reinforcement Learning
Learning to Learn without Gradient Descent by Gradient Descent
RL2: Fast Reinforcement Learning via Slow Reinforcement Learning
Approximate Bayes Optimal Policy Search Using Neural Networks
PathNet: Evolution Channels Gradient Descent in Super Neural Networks
DeepArchitect: Automatically Designing and Training Deep Architectures
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks
Learning Transferable Architectures for Scalable Image Recognition
SMASH: One-Shot Model Architecture Search through HyperNetworks
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
Finding Competitive Network Architectures Within a Day Using UCT
A Flexible Approach to Automated RNN Architecture Generation
Regularized Evolution for Image Classifier Architecture Search
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Searching for Efficient Multi-Scale Architectures for Dense Image Prediction
Searching Toward Pareto-Optimal Device-Aware Neural Architectures
IRLAS: Inverse Reinforcement Learning for Architecture Search
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
EfficientNet-EdgeTPU: Creating Accelerator-Optimized Neural Networks With AutoML
https://www.reddit.com/r/ControlProblem/comments/5jlkgi/why_tool_ais_want_to_be_agent_ais/
Sutton & Barto Book: Reinforcement Learning: An Introduction
https://bair.berkeley.edu/blog/2017/07/18/learning-to-learn/
2018-07-26-synced-googleaichiefjeffdeansmlsystemarchitectureblueprint.html
Solving the Mystery of Link Imbalance: A Metastable Failure State at Scale
https://www.fhi.ox.ac.uk/wp-content/uploads/Reframing_Superintelligence_FHI-TR-2019-1.1-1.pdf
There’s plenty of room at the Top: What will drive computer performance after Moore’s law?
Wikipedia Bibliography:
Waze :