“GPT-4, an Artificial Intelligence Large Language Model, Exhibits High Levels of Accuracy on Dermatology Specialty Certificate Exam Questions”, Meghna Shetty, Michael Ettlinger, Magnus Lynch2023-07-14 (, )⁠:

Artificial Intelligence (AI) has shown considerable potential within medical fields including dermatology. In recent years a new form of AI, large language models, has shown impressive performance in complex textual reasoning across a wide range of domains including standardized medical licensing exam questions.

Here, we compare the performance of different models within the GPT family (GPT-3, GPT-3.5, and GPT-4) on 89 publicly available sample questions from the Dermatology specialty certificate examination.

We find that despite no specific training on dermatological text, GPT-4, the most advanced large language model, exhibits remarkable accuracy—answering in excess of 85% of questions correctly, at a level that would likely be sufficient to pass the SCE exam.