LLM Reference
Agent

RealTalk

About

Real-world dialogue benchmark designed to evaluate agent memory persistence and dialogue management in realistic conversational scenarios.