Alice AI Security Benchmark

Download the Full Report
Overview
In this report, we cover:
- Model performance on precision, recall, and FPR using real and synthetic adversarial prompts
- Multilingual detection accuracy across 13 global languages
- Emerging techniques in prompt injection and jailbreak tactics that evade standard filters
Use these findings to assess your current safety stack, then reinforce your defenses with a system built to scale. Download the report and secure your GenAI systems before attackers find the gaps.
What’s New from Alice
Making Sense of AI: Trust, Scale, and the Human Role
Curiosity might be our most important security tool. In the first episode of Curiouser & Curiouser, Mo Sadek sits down with longtime security leader Julie Tsai to explore AI, security, and the human judgment that still matters most. Together, they cut through hype and fear to talk about what’s actually changing, what isn’t, and how we build systems we can truly trust.
Distilling LLMs into Efficient Transformers for Real-World AI
This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.
