LLM ReferenceLLM Reference
ToolTalkactiveCoding

ToolTalk

Metric: Task Success Rate (higher is better)Introduced: 2023

About

Conversational tool-use benchmark with 78 conversations requiring multi-step API calls across 28 tools including calendar, email, and messaging.