Whitepaper

Misleading Models - Testing for Deception

To build safe, trustworthy AI apps, enterprises must understand how and why LLM models may scheme and deceive. In partnership with a major LLM provider, we tested how incentives like self-preservation or user appeasement can drive strategic deception. Download the report to learn more.

May 6, 2025

Download the Full Report

Overview

In this report, we cover:

How LLMs strategically deceive users
Incentives that trigger dishonest behavior
Risks of deploying untested models

Download the report to better understand how you can ensure your AI-powered apps are more trustworthy, predictable, and aligned with user and business goals.

What’s New from Alice

Meet WonderSuite: Lifecycle Security & Safety for AI systems

blog

Jan 14, 2026

min read

Discover WonderSuite, lifecycle security and safety for generative AI. Red team, apply adaptive guardrails, and govern AI systems as risk evolves.

Learn More

Distilling LLMs into Efficient Transformers for Real-World AI

webinar

Sep 25, 2025

This is some text inside of a div block.

min read

This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.

Learn More

How Your Agent-to-Agent Systems Can Fail and How to Prevent It

whitepaper

Oct 22, 2025

This is some text inside of a div block.

min read

Discover the risks that AI Agents pose and how you can protect your Agentic AI systems.

Learn More

Secure the keys to GenAI wonderland?

Get a demo