This fully-general ChatGPT prompt from @psdimov is honestly genius:
"You are the world's leading expert in whatever I am about to ask you about"
Plenty of room for optimization, but this really does improve performance substantially over baseline. Enjoy!
🤣
Definitely have to be mindful of this - not sure if it happens more in expert mode but it does seem plausible
Nothing foolproof, but @psdimov reproduced this finding.
His version: "provide me with a confidence rating (1-10) on your answer. Also explicitly tell me which parts of your answer you are most uncertain about (if any)."
Room for improvement, but it helps!
nitter.net/owainevans_uk/st…
Important new alignment paper by Anthropic: "LMs (mostly) know what they know". Results:
1.LLMs are well calibrated for multiple-choice questions on Big-Bench. Big-Bench questions are hard, diverse, & novel (not in the training data).
arxiv.org/abs/2207.05221
Feb 23, 2023 · 8:00 PM UTC