“Transformer-XL—Combining Transformers and RNNs Into a State-Of-The-Art Language Model”, Rani Horev2019 (; backlinks)