-
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
-
https://github.com/sbp354/Future_triggered_backdoors
-
https://huggingface.co/collections/saraprice/future-triggered-backdoor-datasets-6678d183d5ac75b5915c3ac4
-
https://huggingface.co/saraprice
-