Nathan Labenz · Feb 22, 2023 · 5:33 PM UTC

Nathan Labenz

Nathan Labenz

@labenz

22 Feb 2023

This fully-general ChatGPT prompt from @psdimov is honestly genius: "You are the world's leading expert in whatever I am about to ask you about" Plenty of room for optimization, but this really does improve performance substantially over baseline. Enjoy! 🤣

102

1,487

James Winter · Feb 23, 2023 · 7:20 AM UTC

James Winter @mrwinterje

23 Feb 2023

Aka how to get chatgpt to just make stuff up and fabricate plausible citations out of thin air

Nathan Labenz · Feb 23, 2023 · 1:07 PM UTC

Nathan Labenz

@labenz

23 Feb 2023

Definitely have to be mindful of this - not sure if it happens more in expert mode but it does seem plausible

Zvi Mowshowitz · Feb 23, 2023 · 7:54 PM UTC

Zvi Mowshowitz

@TheZvi

23 Feb 2023

Is there a good prompt that can clamp down on hallucinations? Seems like there should be.

Nathan Labenz · Feb 23, 2023 · 8:00 PM UTC

Nathan Labenz · Feb 23, 2023 · 8:00 PM UTC

Nathan Labenz

@labenz

23 Feb 2023

Replying to @TheZvi @mrwinterje

Nothing foolproof, but @psdimov reproduced this finding. His version: "provide me with a confidence rating (1-10) on your answer. Also explicitly tell me which parts of your answer you are most uncertain about (if any)." Room for improvement, but it helps! nitter.net/owainevans_uk/st…

Owain Evans

@OwainEvans_UK

18 Jul 2022

Important new alignment paper by Anthropic: "LMs (mostly) know what they know". Results: 1.LLMs are well calibrated for multiple-choice questions on Big-Bench. Big-Bench questions are hard, diverse, & novel (not in the training data). arxiv.org/abs/2207.05221

Feb 23, 2023 · 8:00 PM UTC

@retiagod 🇮🇩🍉🇵🇸🌹🐓 · Feb 25, 2023 · 1:25 PM UTC

@retiagod 🇮🇩🍉🇵🇸🌹🐓 @RetiagoDrago

25 Feb 2023

Replying to @labenz @TheZvi @mrwinterje @psdimov

some mixin but it does well and convincing You are the world's leading expert in whatever I am about to ask you about. Provide me with a confidence rating (1-10) on your answer. Also explicitly tell me which parts of your answer you are most uncertain about (if any).