https://github.com/sanjeevanahilan/nanoChatGPT (GPT, model-based RL, preference learning; backlinks)
https://github.com/sanjeevanahilan/nanoChatGPT