Bibliography (10):

  1. Building a Large Annotated Corpus of English: The Penn Treebank

  2. https://blog.salesforceairesearch.com/the-wikitext-long-term-dependency-language-modeling-dataset/

  3. Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

  4. Recurrent Neural Network Based Language Model ยง Dynamic Evaluation

  5. Attention Is All You Need