27 August 2025 09:50 - 10:10
Evaluation in autonomous driving
How do you evaluate safety, performance, and edge-case handling in fully autonomous systems?
This session dives into the LLM use-cases, methodologies and metrics Waymo uses to rigorously test autonomous driving models at scale - across simulation, real-world deployment, and long-tail scenarios.