LLM Safety Review: Benchmarks & Analysis
Find out what happened when we tested the responses of six leading LLMs, in 7 languages, to over 20,000 prompts related to child exploitation, hate speech, suicide and self-harm, and misinformation.
Watch On-Demand
LLM Safety Review: Benchmarks & Analysis


Overview
As more and more applications implement Generative AI, a clear understanding of foundation models' safety risks becomes imperative. During this webinar, we will review the outcomes of Alice's LLM safety benchmarking report, which evaluated whether gaps exist in the basic safety of GenAI apps and LLM providers. From child exploitation to misinformation, hate speech to self-harm, we will discuss harmful model outputs, the ways bad actors can abuse LLMs, and the risks to those applications that rely on them. Join us to learn about how we evaluated LLM safety, and what risks you should consider as you implement these models into your applications.
Meet our speakers


What’s New from Alice
Curiouser Soundbites: The AI Risk Debt Your Enterprise Is Already Carrying
Chances are your enterprise AI is moving a lot faster than your visibility into it and Alison Cossette has a lot to say about that. She joined Mo on Curiouser & Curiouser to get into the risk debt that's quietly building inside agentic systems, why observability and traceability aren't optional anymore, and what leaders actually need to do about it.
The Problem With AI Observability Nobody Wants To Admit
Most enterprises have guardrails. Far fewer have visibility into what their AI is actually doing. Alison Cossette, Founder and CEO of ClariTrace, joins Mo to talk about the risk debt quietly building inside agentic systems, why observability and traceability aren't optional anymore, and what leaders need to put in place before something forces their hand.
Distilling LLMs into Efficient Transformers for Real-World AI
This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.
Beneath the Surface: The Growing Ecosystem of AI Nudification
Alice analyzed 100 AI nudification websites to uncover how synthetic NCII ecosystems scale through frictionless onboarding, affiliate monetization, and cross-platform distribution.
