Beyond the Leaderboard: Unpacking Function Calling EvaluationAugust 16, 2024 by Kartik Sreenivasan, Jeffrey Chen, Pallavi Koppol, Eitan Turok, Bay Foley-Cox, Asfandyar Qureshi and Sam Havens in Mosaic Research 1. Introduction The research and engineering community at large have been continuously iterating upon Large Language Models (LLMs) in order to make them...