-
https://huggingface.co/datasets/cerebras/SlimPajama-627B
-
https://huggingface.co/datasets
-
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention
-
https://arxiv.org/abs/2002.05202
-
https://www.cerebras.net/product-system/
-
https://huggingface.co/MBZUAI-LLM
-