- See Also
-
Links
- “Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs”, Laine et al 2024
- “Designing a Dashboard for Transparency and Control of Conversational AI”, Chen et al 2024
- “LLM Evaluators Recognize and Favor Their Own Generations”, Panickssery et al 2024
- “Beyond Memorization: Violating Privacy Via Inference With Large Language Models”, Staab et al 2023
- “Taken out of Context: On Measuring Situational Awareness in LLMs”, Berglund et al 2023
- “Truesight”
- “Situational Awareness in Large Language Models”
- “Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs”
- “Language Models Model Us”
- “The Case for More Ambitious Language Model Evals”
- “The Case for More Ambitious Language Model Evals”
- “The Case for More Ambitious Language Model Evals”
- “Early Situational Awareness and Its Implications, a Story”
- Miscellaneous
- Link Bibliography
See Also
Links
“Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs”, Laine et al 2024
Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs
“Designing a Dashboard for Transparency and Control of Conversational AI”, Chen et al 2024
Designing a Dashboard for Transparency and Control of Conversational AI
“LLM Evaluators Recognize and Favor Their Own Generations”, Panickssery et al 2024
“Beyond Memorization: Violating Privacy Via Inference With Large Language Models”, Staab et al 2023
Beyond Memorization: Violating Privacy Via Inference with Large Language Models
“Taken out of Context: On Measuring Situational Awareness in LLMs”, Berglund et al 2023
Taken out of context: On measuring situational awareness in LLMs
“Truesight”
“Situational Awareness in Large Language Models”
“Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs”
Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs:
“Language Models Model Us”
“The Case for More Ambitious Language Model Evals”
“The Case for More Ambitious Language Model Evals”
“The Case for More Ambitious Language Model Evals”
“Early Situational Awareness and Its Implications, a Story”
Miscellaneous
Link Bibliography
-
https://arxiv.org/abs/2406.07882
: “Designing a Dashboard for Transparency and Control of Conversational AI”, -
https://arxiv.org/abs/2309.00667
: “Taken out of Context: On Measuring Situational Awareness in LLMs”,