LLM Reference
Researchers
Models
Providers
Benchmarks
GPQA
Reasoning
Google-Proof Q&A
About
Evaluates graduate level reasoning capabilities
Resources
GitHub
arXiv Paper
HuggingFace
Papers With Code