-
Mind the Gap: Assessing Temporal Generalization in Neural Language Models ยง Scaling
-
Recurrent Neural Network Based Language Model ยง Dynamic Evaluation
-
Recurrent Neural Network Based Language Model
-
Generating Sequences With Recurrent Neural Networks
-
Dynamic Evaluation of Transformer Language Models
-
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models