Bibliography (6):
‘end-to-end’ directory
Microsoft COCO: Common Objects in Context
Simple statistical gradient-following algorithms for connectionist reinforcement learning
Wikipedia Bibliography:
Reinforcement learning
Differentiable function
Variance