Home
Publications
Research Areas
Contact
CV
3
LLM Wardens: Mitigating Adversarial Persuasion with Third-Party Conversational Oversight
Lennart Wachowiak
,
Scott Blain
,
David Williams-King
,
Samuele Marro
PDF
Cite
Code
Project
Cite
×