Speakers Agenda Why attend Partners Venue

Co - located ▼

Locations ▼

Text here

Speakers Agenda Why attend Partners Venue

Co - located ▼

Locations ▼

Get invited

Partnership opportunities

Secure your seat

Call to action

Your text goes here. Insert your content, thoughts, or information in this space.

Button

Back to speakers

Mahmoud

Fahmy

Lead AI Engineer

Mastercard

Mahmoud Fahmy is the Lead AI Engineer at Mastercard, specializing in machine learning deployment and scalable solutions for seamless LLM integration intoenterprise environments. With a strong focus on reliability and efficiency, he has authored several influential publications, including: → Optimizing the Giants: Navigating Technical Challenges and Debt in LLM Deployment. → Deep Learning Pipelines with TensorFlow. His ongoing research explores the complexities of deploying LLMs on-premises, offering technical strategies and governance frameworks tailored for enterprise adoption. Additionally, he is actively researching Agentic AI, focusing on autonomous decision-making systems and their implications for enterprise AI applications. As a recognized thought leader, Mahmoud bridges the gap between cutting-edge AI research and real-world engineering, helping organizations harness the transformative power of LLMs responsibly and effectively.

Button

02 December 2025 14:30 - 15:00

Panel | Designing robust LLM evaluation frameworks for performance and alignment

In this session, we’ll explore how to build robust evaluation frameworks that go beyond benchmarks to capture safety, bias, hallucination, and task-specific performance. We’ll cover practical methods for stress-testing LLMs in enterprise settings, defining custom evaluation metrics, and building scalable pipelines to validate model behavior over time. → Key techniques for measuring alignment, reliability, and safety. → How to operationalise evaluation workflows across teams and use cases.