ActiveFence is now Alice
x
Back
Amazon Nova
-
Case Studies

Validate Model Safety and Benchmark Against Competitors for Responsible Deployment

Amazon Nova partnered with Alice to manually red team Nova Premier, their most advanced generative AI foundation model, testing safety, fairness, bias, and privacy across eight responsible AI categories ahead of enterprise deployment.

Apr 24, 2026
Get a demo
Company Info

Company Size

Industry

GenAI - Foundation Models

About

Amazon AGI is Amazon's artificial general intelligence research division, developing frontier foundation models including the Nova family. Nova Premier represents Amazon's most capable model to date, designed for complex reasoning and serving as a distillation teacher for downstream systems deployed across Amazon Bedrock.

"Through this hands-on evaluation, Alice strengthened Nova’s security posture and supported Amazon’s broader Responsible AI goals, ensuring the model could be deployed with greater confidence."

Rahul Gupta
-
Senior Manager, Responsible AI, Amazon AGI
AT A GLANCE

To help validate its most advanced model to date, Amazon partnered with Alice to red-team Nova Premier against high-risk prompts. The results positioned Nova as safer than its competitors, marking a major step toward secure enterprise deployment.

Challenge

Amazon aimed to rigorously validate the safety of Nova Premier, its most capable foundation model to date, ahead of public release. As foundation models grow more powerful, the attack surface expands - adversarial inputs, prompt injection attempts, fairness failures, and privacy exposures become harder to anticipate through automated testing alone.

Amazon sought a third-party red teaming partner with deep domain expertise to stress-test Nova Premier against real-world adversarial threats across its eight Responsible AI categories — including safety, fairness and bias, and privacy and security -before the model reached enterprise customers. External validation was essential to ensure the evaluation was rigorous, unbiased, and credible."

How Alice Helped

Alice partnered with Amazon as an independent third-party red teamer to conduct manual, blind evaluations of Nova Premier on Amazon Bedrock - ensuring the assessment was uninfluenced by internal assumptions or model familiarity.

Alice's subject matter experts crafted adversarial prompts targeting Nova Premier's most critical risk surfaces, spanning all eight of Amazon's Responsible AI categories: safety, fairness and bias, privacy and security, and more. The manual approach was deliberate - expert-led testing surfaces edge cases, nuanced policy failures, and culturally specific risks that automated pipelines routinely miss.

Alice also conducted comparative LLM benchmarking, evaluating Nova Premier's safety posture against other frontier models to give Amazon a clear picture of where the model stood relative to the competitive landscape ahead of deployment.

The Results

The evaluation provided Amazon with a comprehensive, third-party validated picture of Nova Premier's safety posture ahead of launch.

Key outcomes included:

  • Nova Premier was benchmarked as safer than its competitor models across the tested RAI categories, giving Amazon confidence in its relative safety positioning at launch
  • Expert-led manual testing surfaced edge cases and adversarial vulnerabilities that automated evaluation alone would not have detected
  • Findings directly informed Amazon's pre-launch safety decisions, supporting responsible deployment across Amazon Bedrock
  • The collaboration supported Amazon's broader Responsible AI goals with independent, audit-ready evidence of safety validation

The engagement demonstrated the value of combining expert-led manual red teaming with automated testing  a comprehensive approach that has become essential for any foundation model team preparing for enterprise deployment. For teams facing similar pre-launch validation challenges, explore how Alice approaches foundation model security.

Share

Trusted by security and product teams in the world's most regulated industries

Alice brings years of adversarial intelligence expertise to AI security. We give enterprise teams the coverage that generic guardrails and one-time audits can't match.

Get a demo

What’s New from Alice

Beneath the Surface: The Growing Ecosystem of AI Nudification

whitepaper
May 19, 2026
,
 
May 19, 2026
 -
This is some text inside of a div block.
 min read
May 19, 2026

Alice analyzed 100 AI nudification websites to uncover how synthetic NCII ecosystems scale through frictionless onboarding, affiliate monetization, and cross-platform distribution.

Learn More