
Meet with Alice
About the Event
A focused look at the next wave of AI Agents, the ones you can find in any organization, behind any workflow.
Builders, researchers, and product leaders come together to unpack what it really means to deploy autonomous systems in the real world.
We’ll be in New York, where things move fast - and so does AI.
If you're thinking about deploing Agentic AI - We should definatlky meet.

Evaluating autonomous agents: Bridging the gap between testing and real-world performance
Evaluating autonomous agents is harder than static models or prompt-based systems. Their behavior unfolds over sequences, interacts with tools and environments, and can shift in live conditions in ways offline tests miss. In this panel, engineers share how they measure agent behavior, analyze trajectories over single outputs, and identify the signals that matter in real-world contexts, offering candid insights on what works, what doesn’t, and remaining evaluation challenges.


.jpeg)
