Bibliography (4):

  1. Attention Is All You Need

  2. A domain-specific supercomputer for training deep neural networks

  3. https://research.google/blog/tensorstore-for-high-performance-scalable-array-storage/

  4. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models