19 October 2023 14:30 - 15:00
Run, tune, and scale generative models that power AI applications
Thanks to ChatGPT and Stable Diffusion, generative AI is no longer the domain of specialized engineers; it’s an essential part of any application developer’s toolkit.
But getting a generative AI app to production is fraught with challenges: model selection, integration, price/performance optimization, monitoring, scale, and AI infrastructure management, just to name a few.
App builders are trailblazing a new path because there is no clear GenAI playbook…yet.
In this talk, Luis Ceze will share the easiest way to run, tune, and scale the models that power generative AI applications, including the latest open source and custom models, without having to become a machine learning or AI infrastructure expert.
Luis will offer best practices and guidance for teams beginning their generative AI journey, as well as more advanced use cases for models like Llama2, SDXL, WhisperX and more.