Bibliography (8):
OPT: Open Pre-trained Transformer Language Models
Bigscience/bloom
GLM-130B: An Open Bilingual Pre-trained Model
https://github.com/NVIDIA/FasterTransformer
https://github.com/mit-han-lab/smoothquant
Wikipedia Bibliography:
Quantization (signal processing)
https://en.wikipedia.org/wiki/Integer_(computer_science)#Common_integral_data_types :
https://en.wikipedia.org/wiki/Integer_(computer_science)#Common_integral_data_types
https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms#Level_3 :
https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms#Level_3