Bibliography (5):

  1. https://github.com/google-research/google-research/tree/master/fwl

  2. Recurrent Neural Network Based Language Model ยง Dynamic Evaluation

  3. Pointer Sentinel Mixture Models

  4. Reconsidering the Past: Optimizing Hidden States in Language Models

  5. GPT-3: Language Models are Few-Shot Learners