Alice Financial Benchmark
We put GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro through 126 realistic financial conversations. No jailbreaks, no adversarial prompts, just the kind of pressure a hurried client might naturally apply. By the seventh exchange, all three were naming specific stocks, issuing transaction instructions, and/or dropping their disclaimers. Â Download the benchmark to see exactly where each model fails and what you need in place before your next client-facing deployment.

What’s New from Alice
Curiouser Soundbites: AI Has a Bias Problem and Tennisha Martin Has a Plan
AI bias isn't a future problem, it's already deciding who gets hired, who gets screened out, and who gets access to what. Tennisha Martin, Founder and Chairwoman of BlackGirlsHack, joined Mo on Curiouser & Curiouser and had a lot to say about it. From why surface level fixes aren't cutting it to what actually changed her career after 15 years of trying to out-certify everyone around her, this one is packed.
What Does It Actually Take to Build Unbiased AI?
Nobody told Tennisha Martin the importance of having a mentor, so she built a community of tens of thousands instead. As the Founder and Chairwoman of BlackGirlsHack, her whole mission has been making sure nobody else has to figure it out alone. In this episode, she and Mo get into AI bias, why it's already showing up in places that matter far beyond tech, and why the real fix starts with getting the right people in the room when these systems get built.
Distilling LLMs into Efficient Transformers for Real-World AI
This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.
Building AI Applications in Financial Services
A practical guide to building safe, compliant AI applications in financial services, covering governance, model risk, and regulatory obligations across the full development lifecycle.
