We love good ol’ completion models at Replit.
We just replaced GPT-4 on a backend task with the new gpt-3.5-turbo with no accuracy hit. Faster + cheaper.
Just ran an internal eval on a production task with gpt-3.5-turbo-instruct and it performs at GPT-4 level 👀