Bibliography (16):

  1. Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model

  2. Microsoft announces new supercomputer, lays out vision for future AI work

  3. ZeRO-2 & DeepSpeed: Shattering barriers of deep learning speed & scale

  4. Naver unveils first ‘hyperscale’ AI platform

  5. Chinese AI lab challenges Google, OpenAI with a model of 1.75 trillion parameters