Sign In
Register

Partnership opportunities

Secure your seat

Call to action
Your text goes here. Insert your content, thoughts, or information in this space.
Button

Back to speakers

Mahmoud
Fahmy
Lead AI Engineer
Mastercard
Mahmoud Fahmy is the Lead AI Engineer at Mastercard, specializing in machine learning deployment and scalable solutions for seamless LLM integration intoenterprise environments. With a strong focus on reliability and efficiency, he has authored several influential publications, including: → Optimizing the Giants: Navigating Technical Challenges and Debt in LLM Deployment. → Deep Learning Pipelines with TensorFlow. His ongoing research explores the complexities of deploying LLMs on-premises, offering technical strategies and governance frameworks tailored for enterprise adoption. Additionally, he is actively researching Agentic AI, focusing on autonomous decision-making systems and their implications for enterprise AI applications. As a recognized thought leader, Mahmoud bridges the gap between cutting-edge AI research and real-world engineering, helping organizations harness the transformative power of LLMs responsibly and effectively.
Button
02 December 2025 16:30 - 17:00
Panel | Designing robust LLM evaluation frameworks for performance and alignment
In this session, we’ll explore how to build robust evaluation frameworks that go beyond benchmarks to capture safety, bias, hallucination, and task-specific performance. We’ll cover practical methods for stress-testing LLMs in enterprise settings, defining custom evaluation metrics, and building scalable pipelines to validate model behavior over time. → Key techniques for measuring alignment, reliability, and safety. → How to operationalise evaluation workflows across teams and use cases.