MBPPactiveCoding
Mostly Basic Programming Problems
Metric: Pass@k (higher is better)Introduced: 2021
About
974 Python programming problems for evaluating code generation, ranging from beginner to intermediate difficulty.
974 Python programming problems for evaluating code generation, ranging from beginner to intermediate difficulty.