Bibliography (3):
Discovering Language Model Behaviors with Model-Written Evaluations
PaLM: Scaling Language Modeling with Pathways
https://github.com/google/sycophancy-intervention