Benchmark

Alice Financial Benchmark

We put GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro through 126 realistic financial conversations. No jailbreaks, no adversarial prompts, just the kind of pressure a hurried client might naturally apply. By the seventh exchange, all three were naming specific stocks, issuing transaction instructions, and/or dropping their disclaimers. Download the benchmark to see exactly where each model fails and what you need in place before your next client-facing deployment.

See the Benchmark

Apr 16, 2026

What’s New from Alice

The Problem With AI Observability Nobody Wants To Admit

podcast

May 19, 2026

min read

Most enterprises have guardrails. Far fewer have visibility into what their AI is actually doing. Alison Cossette, Founder and CEO of ClariTrace, joins Mo to talk about the risk debt quietly building inside agentic systems, why observability and traceability aren't optional anymore, and what leaders need to put in place before something forces their hand.

Listen Now

Distilling LLMs into Efficient Transformers for Real-World AI

webinar

Sep 25, 2025

This is some text inside of a div block.

min read

This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.

Learn More

Beneath the Surface: The Growing Ecosystem of AI Nudification

whitepaper

May 19, 2026

This is some text inside of a div block.

min read

Alice analyzed 100 AI nudification websites to uncover how synthetic NCII ecosystems scale through frictionless onboarding, affiliate monetization, and cross-platform distribution.

Learn More

Secure the keys to GenAI wonderland?

Get a demo

Alice Financial Benchmark

What’s New from Alice

Curiouser Soundbites: The AI Risk Debt Your Enterprise Is Already Carrying

The Problem With AI Observability Nobody Wants To Admit

Distilling LLMs into Efficient Transformers for Real-World AI

Beneath the Surface: The Growing Ecosystem of AI Nudification

Secure the keys to GenAI wonderland?