I’ve been playing with using GPT-3 to control a browser the last couple days. Here’s a quick demo. As you can see it's pretty neat! But also quite flakey.
Will publish the source code shortly for others to try and improve.
Sep 29, 2022 · 11:38 PM UTC
This was inspired by Sharif's hack from a while back: nitter.net/sharifshameem/st…
And of course by the impressive work Adept is doing.
But this is just a toy, not very durable (yet?).
The hardest part was using the chrome debugger protocol and playwright to suck the DOM out of Chrome and turn it into something GPT-3 can read in a single prompt. A very smart person I found on Upwork helped me a ton with that (Thanks Alex!)