Session

Composing High-Accuracy AI Systems With SLMs and Mini-Agents

Overview

Experience	In Person
Type	Breakout
Track	Artificial Intelligence
Industry	Enterprise Technology, Health and Life Sciences, Financial Services
Technologies	AI/BI, Llama
Skill Level	Beginner
Duration	40 min

For most companies, building compound AI systems remains aspirational. LLMs are powerful, but imperfect, and their non-deterministic nature makes steering them to high accuracy a challenge. In this session, we’ll demonstrate how to build compound AI systems using SLMs and highly accurate mini-agents that can be integrated into agentic workflows. You'll learn about breakthrough techniques, including: memory RAG, an embedding algorithm that reduces hallucinations using embed-time compute to generate contextual embeddings, improving indexing and retrieval, and memory tuning, a finetuning algorithm that reduces hallucinations using a Mixture of Memory Experts (MoME) to specialize models with proprietary data. We’ll also share real-world examples (text-to-SQL, factual reasoning, function calling, code analysis and more) across various industries. With these building blocks, we’ll demonstrate how to create high accuracy mini-agents that can be composed into larger AI systems.

Session Speakers

Sharon Zhou

/CEO & Cofounder
Lamini