ModCon 2023 Breakout Session: MAX Development to Production in the Cloud

Ғылым және технология

In this talk Modular engineers Alex Nikitin and Navroop Bath discuss how to take Modular AI Engine performance optimized models to production. They show loading a model, optimizing it for performance using the Modular AI Engine, containerizing the model and a serving framework and using container orchestration systems like Kubernetes to host the model as a service.

Пікірлер