Bibliography (3):
MMLU: Measuring Massive Multitask Language Understanding
https://github.com/EQ-bench/EQ-Bench
https://eqbench.com/