https://www.lesswrong.com/posts/YsCRXZYr5DcJ84XHq/me-myself-and-ai-the-situational-awareness-dataset-sad-for
https://x.com/OwainEvans_UK/status/1810353892120736058
https://openai.com/blog/chatgpt/
MMLU: Measuring Massive Multitask Language Understanding
https://situational-awareness-dataset.org/