We love good ol’ completion models at Replit. We just replaced GPT-4 on a backend task with the new gpt-3.5-turbo with no accuracy hit. Faster + cheaper.
Just ran an internal eval on a production task with gpt-3.5-turbo-instruct and it performs at GPT-4 level 👀

Sep 20, 2023 · 2:34 AM UTC

Replying to @amasad
This is interesting. Would you use gpt 35 turbo for coding as well?
Testing the new instruct rn
Replying to @amasad
what was the task. did it involve code gen?
Classification kinda. No
Replying to @amasad
Do you mean turbo or instruct?
Replying to @amasad
kind of shocking tbh fingers crossed we get gpt-4 instruct soon.
Replying to @amasad
rlhf really did a number to these models Been testing instruct 3.5 as well, for a variety of tasks and its reallly good
Replying to @amasad
How did you define accuracy? Or was this a fairly simple deterministic operation?
Replying to @amasad
Any chance you could share some examples of the kinds of prompts you're using here? I'm still trying to figure out what to use instruct for as opposed to regular turbo
Replying to @amasad
Will cover in #thursdai, definitely interesting from OAI