https://github.com/nomic-ai/gpt4all (ML dataset, knowledge distillation, GPT-4 nonfiction)
https://github.com/nomic-ai/gpt4all