Session

Optimize Cost and User Value Through Model Routing AI Agent

Overview

ExperienceIn Person
TypeBreakout
TrackArtificial Intelligence
IndustryHealth and Life Sciences, Professional Services, Financial Services
TechnologiesMLFlow, Llama, Mosaic AI
Skill LevelAdvanced

Each LLM has unique strengths and weaknesses, and there is no one-size-fits-all solution. Companies strive to balance cost reduction with maximizing the value of their use cases by considering various factors such as latency, multi-modality, API costs, user need, and prompt complexity. Model routing helps in optimizing performance and cost along with enhanced scalability and user satisfaction. 

 

Overview of cost-effective models training using AI gateway logs, user feedback, prompt, and model features to design an intelligent model-routing AI agent. Covers different strategies for model routing, deployment in Mosaic AI, re-training, and evaluation through A/B testing and end-to-end Databricks workflows. Additionally, it will delve into the details of training data collection, feature engineering, prompt formatting, custom loss functions, architectural modifications, addressing cold-start problems, query embedding generation and clustering through VectorDB, and RL policy-based exploration.

Session Speakers

Aditya Gautam

/Senior Machine Learning Engineer
Meta