Request to partner

Get your pass

Call to action
Your text goes here. Insert your content, thoughts, or information in this space.
Button

Back to speakers

Mo
Sadek
Technical Director
Alice
Mo is an AI security practitioner with over a decade of experience in application and product security, spanning hands-on engineering, program building, and cross-functional leadership. At Alice, he focuses on advancing how organizations understand, test, and manage real-world risks in AI systems, helping bridge security research, product, and emerging AI governance needs.
Button
04 June 2026 12:00 - 12:30
Panel | Evaluating autonomous agents: Bridging the gap between testing and real-world performance
Evaluating autonomous agents is harder than static models or prompt-based systems. Their behavior unfolds over sequences, interacts with tools and environments, and can shift in live conditions in ways offline tests miss. In this panel, engineers share how they measure agent behavior, analyze trajectories over single outputs, and identify the signals that matter in real-world contexts, offering candid insights on what works, what doesn’t, and remaining evaluation challenges.