Session

Self-Improving Agents and Agent Evaluation With Arize & Databricks ML Flow

Overview

Experience	In Person
Type	Breakout
Track	Artificial Intelligence
Industry	Enterprise Technology
Technologies	MLFlow, Mosaic AI
Skill Level	Intermediate
Duration	40 min

As autonomous agents become increasingly sophisticated and widely deployed, the ability for these agents to evaluate their own performance and continuously self-improve is essential. However, the growing complexity of these agents amplifies potential risks, including exposure to malicious inputs and generation of undesirable outputs. In this talk, we'll explore how to build resilient, self-improving agents. To drive self-improvement effectively, both the agent and the evaluation techniques must simultaneously improve with a continuously iterating feedback loop. Drawing from extensive real-world experiences across numerous productionized use cases, we will demonstrate practical strategies for combining tools from Arize, Databricks MLflow and Mosaic AI to evaluate and improve high-performing agents.

Session Speakers

Aprana Dhinakaran

/Co-Founder and Chief Product Officer
Arize