ActiveFence is now Alice
x
Back
Whitepaper

Misleading Models - Testing for Deception

To build safe, trustworthy AI apps, enterprises must understand how and why LLM models may scheme and deceive. In partnership with a major LLM provider, we tested how incentives like self-preservation or user appeasement can drive strategic deception. Download the report to learn more.
May 6, 2025

Overview

In this report, we cover:
  • How LLMs strategically deceive users
  • Incentives that trigger dishonest behavior
  • Risks of deploying untested models
Download the report to better understand how you can ensure your AI-powered apps are more trustworthy, predictable, and aligned with user and business goals.

Download the Full Report

What’s New from Alice

Secure AWS Strands Agents with Alice WonderFence

blog
Mar 29, 2026
,
 
Mar 29, 2026
 -
 min read
March 29, 2026

Learn how to use Strands Agent hooks to enforce safety and security policies with Alice WonderFence in production-oriented agent workflows.

Learn More

Secure the keys to GenAI wonderland?

Get a demo
Red-Team Lab