LLM ReferenceLLM Reference
BFCL v3supersededAgentsTool use

Berkeley Function Calling Leaderboard v3

Metric: Function Calling Accuracy (higher is better)Introduced: 2023

Superseded by: bfcl

About

Version 3 of Berkeley Function Calling Leaderboard, evaluating model accuracy on function and API calls. The collected April 2026 slice has limited frontier-model coverage and is superseded by the current BFCL leaderboard.