Bibliography (3):
https://www.latent.space/p/fastai#%C2%A7replacing-fine-tuning-with-continued-pre-training
https://nlp.fast.ai/category/classification.html
Pointer Sentinel Mixture Models