“Transformer-XL—Combining Transformers and RNNs Into a State-Of-The-Art Language Model”, Rani Horev2019 (recurrent Transformers; backlinks)
View HTML:
Transformer-XL—Combining Transformers and RNNs Into a State-Of-The-Art Language Model