Bibliography (15):

  1. https://machinetranslate.org/wmt

  2. Mind the Gap: Assessing Temporal Generalization in Neural Language Models ยง Dynamic Evaluation

  3. Recurrent Neural Network Based Language Model ยง Dynamic Evaluation

  4. Recurrent Neural Network Based Language Model

  5. Generating Sequences With Recurrent Neural Networks

  6. Dynamic Evaluation of Transformer Language Models

  7. https://arxiv.org/pdf/2102.01951.pdf#page=18&org=deepmind

  8. https://arxiv.org/pdf/2102.01951#page=5&org=deepmind

  9. BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models