Update: GPT-4 now solves this Advent of Code problem perfectly on the first try, whereas ChatGPT required many back-and-forth iterations...
Excited to find out where the new boundary is
Had a fun time getting ChatGPT to solve today's Advent of Code puzzle
I'd describe its performance in human terms as "nervous interview candidate who drank too much coffee": pretty smart, makes careless mistakes, responds well to feedback, works very fast. 1/
Mar 14, 2023 · 9:39 PM UTC
Prompt and output below for this example -- confirmed that GPT-3.5 fails on first attempt w/ the exact same prompt...
On Advent of Code 2022 Day 5 (adventofcode.com/2022/day/5), it didn't get it right on the first try... 😅
...but it did succeed after 2 iterations of reading and responding to errors, with no manual hints!
I previously found Day 8 was beyond GPT-3's capabilities... nitter.net/geoffreylitt/sta…
GPT-4 looked like it was in danger of reverting to nonsensical revisions after 2 incorrect iterations, but somehow succeeded on its 3rd attempt
On Day 9 (adventofcode.com/2022/day/9) it flailed for 5 attempts and didn't get it right. Similar failure modes to GPT-3, making up nonsensical explanations about why the code was wrong
Interestingly, previously for this Day 9 problem, I had found that GPT was useful for generating a runtime viz that helped me code a solution, altho it wasn't capable of solving by itself
I guess Human-AI collab is still a thing for now!