Bibliography (3):
T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
PaLM: Scaling Language Modeling with Pathways
https://github.com/google-research/distilling-step-by-step