LLM Reference
Coding

CRUXEval

About

Cross-language code evaluation benchmark testing code reasoning, repair, and generation across C++, Java, Python, and other languages.