Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps
MuSiQue: Multi-hop Questions via Single-hop Question Composition
InstructGPT: Training language models to follow instructions with human feedback
Self-Consistency Improves Chain-of-Thought Reasoning in Language Models
Boosting Theory-of-Mind Performance in Large Language Models via Prompting
Ask Me Anything (AMA): A simple strategy for prompting language models
What Can a Generative Language Model Answer About a Passage?
WebGPT: Improving the factual accuracy of language models through web browsing
Wikipedia Bibliography:
Platoββ: