The wisdom of the inner crowd in three large natural experiments
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Are NLP Models really able to Solve Simple Math Word Problems?
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
Unsupervised Neural Machine Translation with Generative Language Models Only
Explainable Multi-hop Verbal Reasoning Through Internal Monologue
Wikipedia Bibliography: