Bibliography (5):

  1. Recurrent Neural Network Based Language Model ยง Dynamic Evaluation

  2. Dynamic Evaluation of Neural Sequence Models

  3. Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

  4. Language Models are Unsupervised Multitask Learners

  5. Pointer Sentinel Mixture Models