How Alice Helped One of the Most Recognized Toy Companies Gain Full Control of Its Child-Facing Product
A leading global toy company had built a personalized storytelling experience for children that combined user-generated inputs, AI-generated narrative, and physical product delivery. The application was built on top of a leading large language model (LLM), but when the LLM made an unannounced change to its default content guardrails, it disrupted the company's production system entirely. The client came to Alice with a clear requirement: greater control and the ability to customize their guardrails. Alice delivered a complete custom guardrails solution covering both text and image content, including a novel detection capability built specifically for the product's unique and nuanced format, within two weeks.
How Alice Helped One of the Most Recognized Toy Companies Gain Full Control of Its Child-Facing Product
Company Size
Industry
About
Following a platform-dependency incident that disrupted their entire content moderation system, Alice delivered a full custom guardrails solution covering text and image enforcement within two weeks. Four specialized detection capabilities were built for the client's specific risk profile: competitor logo detection, children-in-risky-context detection, age-bucket classification, and a first-of-its-kind LLM-based context detector designed for the product's historically grounded format. The engagement replaced fragile default dependencies with an enforcement layer the client fully owns.
Challenge
The client had built a personalized storytelling experience for children. Users upload a photo and share their interests; the application generates a custom story featuring them alongside a beloved character, complete with AI-generated imagery, available to purchase as a physical product.
To enforce content policy across user inputs and AI-generated outputs, the client had been relying on the out-of-the-box guardrails that came with the LLM they were building on. When that LLM made an unannounced change to its guardrails, the effect rippled through the client's entire production environment.
Depending on default guardrails meant that enforcement rules could change at any time, for any reason, without warning. The client came to Alice seeking a solution that offered greater control and customization across both text and image generation, within the three-week window they had before going to production.
There was also a challenge specific to their product format that required a high degree of nuance. Characters, settings, costumes, and props are expected to reflect their narrative context accurately, and meeting this requirement demanded contextual reasoning that standard out-of-the-box guardrails are not built to provide.
How Alice Helped
Alice delivered a complete guardrails solution covering text and image enforcement end-to-end, within two weeks.
Four custom detection capabilities were built for the client's deployment:
→ Competitor logo detection, identifying brand marks in user-uploaded and AI-generated images
→ Children in risky contexts, detecting unsafe imagery involving minors
→ Age-bucket classification, with a specific enforcement threshold at 13 and above
→ Narrative context detection, an LLM-based capability that flags contextually inaccurate content in generated images
Because the client had no production data at the start of the engagement, Alice ran sample data through the platform and provided a full statistical analysis comparing out-of-the-box guardrail performance against Alice's custom models. The result gave the client a clear, evidence-based view of what the customized solution delivered, and the ability to iterate on their actual content before going live.
The Results
Four custom detection capabilities replaced a generic moderation configuration that had already proven it could change without notice. Enforcement was now built to the client's policies, characters, and child safety requirements, not borrowed from a platform default.
Custom enforcement delivered measurably better precision on the risks that matter for a child-facing product. Product and safety stakeholders had the evidence they needed to move forward. The client launched with a guardrails layer that reflects who they are and what their product demands, and, for the first time, full control over their own safety layer.
Trusted by security and product teams in the world's most regulated industries
Alice brings years of adversarial intelligence expertise to AI security. We give enterprise teams the coverage that generic guardrails and one-time audits can't match.
Get a demoWhat’s New from Alice
Curiouser Soundbites: The AI Risk Debt Your Enterprise Is Already Carrying
Chances are your enterprise AI is moving a lot faster than your visibility into it and Alison Cossette has a lot to say about that. She joined Mo on Curiouser & Curiouser to get into the risk debt that's quietly building inside agentic systems, why observability and traceability aren't optional anymore, and what leaders actually need to do about it.
Afraid AI Will Replace You? Here's the One Skill It Can't
James Villarrubia went from building AI for NASA's drone and aerospace programs to becoming CTO of a travel tech company. In this episode, he and Mo get into why curiosity might be the most important skill in the AI era, what happens to our brains when we stop pushing back on the answers we get, and why the people most resistant to AI might actually be seeing something the rest of us are missing.
It Takes AI to Break AI: The Case for AI Red Teaming
As AI systems gain autonomy, organizations need security approaches built specifically for AI behavior. Learn why AI-driven red teaming is becoming a critical defense layer.
Evaluation of Instagram Teen Accounts
This report evaluates default and opt-in content protections under real-world and adversarial conditions. The study examines safeguard effectiveness, resilience against attempts to surface inappropriate content, and platform improvements made following testing.
