Bibliography (4):
https://github.com/google/BIG-bench
https://openai.com/
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Wikipedia Bibliography:
Transformer (deep learning architecture)