Calling All Functions
Image created by author using Dall-EBenchmarking OpenAI function calling and explanationsThanks to Roger Yang for his contributions to this pieceObservability in third-party large language models (LLMs) is largely approached with benchmarking and evaluations since models like Anthropic’s Claude, OpenAI’s GPT models, and Google’s PaLM 2 are proprietary. In this blog post, we benchmark OpenAI’s GPT models with function calling and explanations against various performance metrics. We are specifically interested in how the…