Recurrent Neural Network Based Language Model ยง Dynamic Evaluation
Dynamic Evaluation of Neural Sequence Models
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Language Models are Unsupervised Multitask Learners
Pointer Sentinel Mixture Models