Bibliography (8):

  1. https://valle-demo.github.io/

  2. GPT-3: Language Models are Few-Shot Learners

  3. https://danielpovey.com/files/2015_icassp_librispeech.pdf

  4. AudioLM: a Language Modeling Approach to Audio Generation