Building safe GenAI applications: An end-to-end overview of red teaming for Large Language Models

A survey covering attack methods, evaluation, metrics, and tools for identifying and mitigating GenAI application vulnerabilities.

NAACL
April 30, 2025

Latest publications