Bibliography (8):

  1. https://huggingface.co/datasets/cerebras/SlimPajama-627B

  2. https://huggingface.co/datasets

  3. Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention

  4. https://arxiv.org/abs/2002.05202

  5. https://www.cerebras.net/product-system/

  6. https://huggingface.co/MBZUAI-LLM