Protects the LGBTQ+ Community with Proactive Moderation Efforts
TrevorSpace needed to move beyond manual moderation and user flags to protect its community at scale. Alice deployed contextual AI detection tuned to the LGBTQ+ community's language, with automated, policy-aligned enforcement across thousands of forums.

Lifesaving Support for LGBTQ Youth

Company Size
Industry
About

“As the leader of our peer-to-peer networks, it’s crucial to ensure that we are creating a safe online space for all LGBTQ+ young people. With Alice, we’ve found a partner from day one to help safeguard our community, while strengthening our real-time moderation efforts. ”
The Trevor Project is the leading suicide prevention and crisis intervention organization for LGBTQ+ young people. With the rise in online harassment against the LQBTQ+ community, The Trevor Project is a critical place for young LGBTQ+ folks to have as a safe resource to reach out to and feel protected. With its goal to prevent LGBTQ+ youth suicide and self-harm, it was vital for The Trevor Project to implement Trust & Safety mechanisms to proactively moderate messages that could be toxic or harmful in order to increase the protection of their community. By moving to a proactive approach, they were able to quickly take action on harmful content and spend more time building a thriving community focused on peer-to-peer engagement.
Challenge
When a young person reaches out on TrevorSpace, they may be in crisis. In the case of suicide and self-harm, it's crucial to ensure the team takes action quickly to help a user access life-saving care in a timely manner. That's the environment The Trevor Project operates in every day.
Given the volume of content posted to TrevorSpace on a daily basis, it's impossible to monitor every interaction on the platform. Prior to Alice, the team relied on manual moderation, historical systems, and user flags to catch content that violated their policies. As a non-profit with limited resources, this approach couldn't scale with the growth of the community.
When launching TrevorSpace, the team understood the importance of implementing safety by design. Yet after launch, the popularity of the site meant they needed a vendor to help reduce reliance on user flags and automate content moderation for specific abuse areas, taking action on harmful content in a more operationally efficient way.
How Alice Helped
Detection
The Trevor Project needed a content moderation vendor that could cover the violations critical to their community: harassment and bullying, hate speech, child solicitation, suicide and self-harm. Coverage alone wasn't enough. The quality of the models was crucial. In an effort to reduce undetected content, they turned to ActiveScore, Alice's contextual AI automated detection capabilities.
A key challenge was ensuring detection models understood the difference between harmful content and the language LGBTQ+ youth naturally use in their community. Alice customized its hate speech models to identify relevant keyword lists that would remove words commonly used among the LGBTQ+ youth community from being flagged, aligning detection to The Trevor Project's specific policy rather than applying a generic model.
Enforcement
To strike a balance between providing a safe space and allowing the necessary freedom for community expansion, The Trevor Project needed a partner to implement their warnings and penalties guidelines quickly and effectively on TrevorSpace.
When a user violates a guideline and the moderation team becomes aware of it, users are issued warning points. TrevorSpace leverages no-code workflows to automatically implement these policies. Anyone with 0-5 points automatically receives a warning. Anyone with 6-7 points receives a two-week suspension. Those with over 8 points are permanently banned. They also use moderation queue management to manually moderate community messages with greater efficiency.
The Results
By using Alice, The Trevor Project is able to ensure greater protection against the most egregious harms facing their community on the TrevorSpace platform. Detection models are tuned to their specific community rather than working against it. Enforcement is automated, consistent, and aligned to their policies without requiring manual intervention at every step.
By incorporating a proactive approach to moderation, they have moderated thousands of forums on the platform and ensured that their users have a safe space to discuss the issues that matter most to them.
Trusted by security and product teams in the world's most regulated industries
Alice brings years of adversarial intelligence expertise to AI security. We give enterprise teams the coverage that generic guardrails and one-time audits can't match.
Get a demoWhat’s New from Alice
Curiouser Soundbites: The AI Risk Debt Your Enterprise Is Already Carrying
Chances are your enterprise AI is moving a lot faster than your visibility into it and Alison Cossette has a lot to say about that. She joined Mo on Curiouser & Curiouser to get into the risk debt that's quietly building inside agentic systems, why observability and traceability aren't optional anymore, and what leaders actually need to do about it.
The Problem With AI Observability Nobody Wants To Admit
Most enterprises have guardrails. Far fewer have visibility into what their AI is actually doing. Alison Cossette, Founder and CEO of ClariTrace, joins Mo to talk about the risk debt quietly building inside agentic systems, why observability and traceability aren't optional anymore, and what leaders need to put in place before something forces their hand.
Distilling LLMs into Efficient Transformers for Real-World AI
This technical webinar explores how we distilled the world knowledge of a large language model into a compact, high-performing transformer—balancing safety, latency, and scale. Learn how we combine LLM-based annotations and weight distillation to power real-world AI safety.
Beneath the Surface: The Growing Ecosystem of AI Nudification
Alice analyzed 100 AI nudification websites to uncover how synthetic NCII ecosystems scale through frictionless onboarding, affiliate monetization, and cross-platform distribution.
