Bibliography (8):
Longformer: The Long-Document Transformer
BigBird: Transformers for Longer Sequences
WaveNet: A Generative Model for Raw Audio
Attention Is All You Need
Long Range Arena (LRA): A Benchmark for Efficient Transformers
Wikipedia Bibliography:
https://en.wikipedia.org/wiki/Language_model :
https://en.wikipedia.org/wiki/Language_model
Computational complexity theory
Transformer (deep learning architecture)