Bibliography (24):

GPT-3: Language Models are Few-Shot Learners
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
https://arxiv.org/pdf/2210.03350#page=17&org=allen
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps
MuSiQue: Multi-hop Questions via Single-hop Question Composition
Large Language Models are Zero-Shot Reasoners
InstructGPT: Training language models to follow instructions with human feedback
https://arxiv.org/pdf/2210.03350.pdf#page=24
RealTime QA: What’s the Answer Right Now?
https://serpapi.com/
https://arxiv.org/pdf/2210.03350#page=24&org=allen
Large Language Models Can Self-Improve
Language Model Cascades
Self-Consistency Improves Chain-of-Thought Reasoning in Language Models
Language Models (Mostly) Know What They Know
Boosting Theory-of-Mind Performance in Large Language Models via Prompting
Ask Me Anything (AMA): A simple strategy for prompting language models
What Can a Generative Language Model Answer About a Passage?
Language Models are Multilingual Chain-of-Thought Reasoners
Predictability and Surprise in Large Generative Models
WebGPT: Improving the factual accuracy of language models through web browsing
Wikipedia Bibliography:
1. Plato :
  
  https://en.wikipedia.org/wiki/Plato
2. https://en.wikipedia.org/wiki/.gr :
  
  https://en.wikipedia.org/wiki/.gr
3. Statistical power