“NVIDIA/Megatron-LM: Ongoing Research Training Transformer Models at Scale” (; backlinks)