https://machinetranslate.org/wmt
Mind the Gap: Assessing Temporal Generalization in Neural Language Models ยง Dynamic Evaluation
Recurrent Neural Network Based Language Model ยง Dynamic Evaluation
Recurrent Neural Network Based Language Model
Generating Sequences With Recurrent Neural Networks
Dynamic Evaluation of Transformer Language Models
https://arxiv.org/pdf/2102.01951.pdf#page=18&org=deepmind
https://arxiv.org/pdf/2102.01951#page=5&org=deepmind
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models