Partnership opportunities

Register now

Call to action
Your text goes here. Insert your content, thoughts, or information in this space.
Button

Back to speakers

Zishan
Ahmed Shaikh
Senior Data Scientist
TomTom
A multi-patented data scientist and AI expert, Zishan has more than a decade of experience in AI using Computer Vision for multiple domains. His expertise is reflected in multiple granted US patents for solutions created during his tenure as Data Scientist at top MNCs. His expertise includes geo-spatial solutions.
12 September 2024 15:00 - 15:30
Automating computer vision workflows with foundation models
Recent advancements in foundation models like GPT- 4o, Florence, SAM2 are revolutionising computer vision. Traditional computer vision tasks often required custom training for specific applications, consuming significant time and computational resources. Foundation models, however, are large-scale, pre-trained models that serve as a universal base for a wide range of tasks. These models can be fine-tuned with minimal data, enabling faster deployment. The talk will highlight how foundation models streamline the automation of complex workflows, reducing manual intervention and enabling scalable solutions. We discuss real-world applications and understand the future potential of combining vision-based models with broader LLM systems to automate and enhance visual data processing at scale.