Bibliography (6):

  1. https://github.com/NVIDIA/FasterTransformer

  2. https://huggingface.co/

  3. https://github.com/FMInference/DejaVu