GenAI: The New Attack Vector for Trust & Safety
In this webinar we will talk with Misinformation, child safety and content moderation experts to discuss the threats and the opportunities of Gen AI in the Trust & Safety work.
Watch On-Demand
GenAI: The New Attack Vector for Trust & Safety

Overview
Generative AI (GenAI) is fundamentally transforming the way people work and live. While its positive contributions to society are evident, GenAI also has the potential to become a harmful tool. Threat actors increasingly use GenAI as a new vector of attack to scale up their production of malicious content and activities. As a result, Trust & Safety teams across UGC platforms face a new set of challenges as they deal with higher volumes of violative content, which are becoming even harder to detect. In this webinar we will talk with Misinformation, child safety and content moderation experts to discuss the threats and the opportunities of Gen AI in the Trust & Safety work.
Meet our speakers

What’s New from Alice
Your LLM Has No Idea What It's Doing
Diana Kelley, CISO at Noma Security and former Cybersecurity CTO at Microsoft, joins Mo to work through the real mechanics of LLM risk: why the context window flattens the trust boundary between system instructions and user data, why that makes reliable internal guardrails essentially impossible, and why agentic AI is less a new threat category and more a stress test for the hygiene debt organizations never fully paid off.
Distilling LLMs into Efficient Transformers for Real-World AI
This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.
Exposing the Hidden Risks of AI Toys
AI-powered toys are entering children’s everyday lives, but new research reveals serious safety gaps. Alice testing shows how child-like interactions can lead to inappropriate content, unsafe conversations, and risky behaviors.
