Bibliography (3):

  1. T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

  2. PaLM: Scaling Language Modeling with Pathways

  3. https://github.com/google-research/distilling-step-by-step