Calibrating the Mosaic Evaluation GauntletApril 30, 2024 by Tessa Barton in Mosaic Research A good benchmark is one that clearly shows which models are better and which are worse. The Databricks Mosaic Research team is dedicated...