-
GPT-3: Language Models are Few-Shot Learners
-
OPT: Open Pre-trained Transformer Language Models
-
Bigscience/bloom
-
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
-
https://github.com/THUDM/GLM-130B/blob/main/docs/quantization.md
-
https://github.com/THUDM/GLM-130B
-
https://x.com/alexjc/status/1617152800571416577
-