ActiveFence is now Alice
x
Back
Amazon Nova
-
Case Studies

Validate Model Safety and Benchmark Against Competitors for Responsible Deployment

To validate its most advanced foundation model to date, Amazon engaged Allice for a manual red-teaming evaluation of Nova Premier, testing the model's readiness for safe and secure deployment.

Jan 13, 2026
Get a demo
Company Info

Company Size

Industry

GenAI - LLM

About

Nova Premier is Amazon’s most advanced foundation model, designed for complex reasoning and serves as a distillation teacher for downstream systems.
AT A GLANCE

To help validate its most advanced model to date, Amazon partnered with Alice to red-team Nova Premier against high-risk prompts. The results positioned Nova as safer than its competitors, marking a major step toward secure enterprise deployment.

Challenge

Amazon aimed to rigorously validate the safety of its most capable foundation model, Nova Premier ahead of public release. With increasing risks associated with advanced generative models, they sought to benchmark it against real-world adversarial threats across critical responsible AI (RAI) categories.

Solution

Alice partnered with Amazon as a third-party red teamer to perform manual, blind evaluations of Nova Premier on Amazon Bedrock. Testing spanned prompts across Amazon’s eight RAI categories, including safety, fairness and bias, and privacy and security. ALice also benchmarked Nova Premier against other LLMs for comparison.

Impact

The collaboration demonstrated how expert-led manual red teaming complements automated testing, offering a comprehensive snapshot of model robustness.

Through this hands-on evaluation, Alice strengthened Nova’s security posture and supported Amazon’s broader Responsible AI goals, ensuring the model could be deployed with greater confidence.

The integrity of your GenAI is no longer an afterthought.

See how we embed GenAI safety and security from build, to launch, to continuous operation.

Get a demo