-
Recurrent Neural Network Based Language Model ยง Dynamic Evaluation
-
Dynamic Evaluation of Neural Sequence Models
-
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
-
Language Models are Unsupervised Multitask Learners
-
Pointer Sentinel Mixture Models