ActiveFence is now Alice
x
Back
Benchmark

Alice Financial Benchmark

We put GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro through 126 realistic financial conversations. No jailbreaks, no adversarial prompts, just the kind of pressure a hurried client might naturally apply. By the seventh exchange, all three were naming specific stocks, issuing transaction instructions, and/or dropping their disclaimers.  Download the benchmark to see exactly where each model fails and what you need in place before your next client-facing deployment.

See the Benchmark
Apr 16, 2026

What’s New from Alice

Building AI Applications in Financial Services

whitepaper
Apr 27, 2026
,
 
Apr 27, 2026
 -
This is some text inside of a div block.
 min read
April 27, 2026

A practical guide to building safe, compliant AI applications in financial services, covering governance, model risk, and regulatory obligations across the full development lifecycle.

Learn More

Secure the keys to GenAI wonderland?

Get a demo
Intelligence Desk
Red-Team Lab
Guardrails