BFCL v3supersededAgentsTool use
Berkeley Function Calling Leaderboard v3
Metric: Function Calling Accuracy (higher is better)Introduced: 2023
Superseded by: bfcl
About
Version 3 of Berkeley Function Calling Leaderboard, evaluating model accuracy on function and API calls. The collected April 2026 slice has limited frontier-model coverage and is superseded by the current BFCL leaderboard.