Bibliography (6):
‘end-to-end’ directory
Playing Atari with Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Wikipedia Bibliography:
Reinforcement learning
Differentiable function
Softmax function