LLM ReferenceLLM Reference
Code editing across 8 programming languagesCoding

Aider Polyglot

About

Real-world code editing benchmark measuring a model's ability to apply changes to existing codebases across 8 programming languages (Python, JavaScript, TypeScript, Go, Rust, Java, C++, C). Tests whole-file editing by asking models to solve exercises from the Exercism platform. Score is % of exercises completed correctly.

Resources

WebsiteGitHub