Bibliography (7):

  1. https://github.com/vlievin/medical-reasoning

  2. GPT-3: Language Models are Few-Shot Learners

  3. InstructGPT: Training language models to follow instructions with human feedback

  4. PubMedQA: A Dataset for Biomedical Research Question Answering

  5. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

  6. Language Models (Mostly) Know What They Know