WebGPT reproduced from advanced prompting only.
Dust-based web-search assistant demo answers questions by searching the web, summarizing content and compiling a final answer with references:
dust.tt/spolu/a/41770fd3d9
Oct 31, 2022 · 3:27 PM UTC
Step 1/ 2/ 3/ replace browsing / clicking / scrolling / selecting with a more direct API-based approach. Less shiny but more resilient.
Note that text-davinci-002 is really good at step 3/ out of the box without few-shot examples. Hypothesis: Instruct-series training set probably includes a lot of summarization tasks.
Step 4/ required few-shot prompting to teach the model how to aggregate content and generate references. Two examples were sufficient! Completely fails otherwise.
Original WebGPT used 6000 human demonstrations. This uses tools (@replit packaged headless browsing) + APIs (google) + 2 examples.
We hypothesize that data and symbolic structure can be traded for many use-cases. We'll eval this app against the same benchmarks to confirm this.
Study the app here: dust.tt/spolu/a/41770fd3d9
Looking forward to see how this can be adapted to productive use-cases without the need for collecting human trajectories.