CTRL: A Conditional Transformer Language Model For Controllable Generation
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
https://factquizmaster.com/
https://arxiv.org/pdf/2501.01956#page=8
TTT-NN: Test-Time Training on Nearest Neighbors for Large Language Models