Reasoning
DynaBench
About
Dynamic benchmark for natural language understanding with adversarially-collected data to prevent data contamination and encourage robust model development.
Dynamic benchmark for natural language understanding with adversarially-collected data to prevent data contamination and encourage robust model development.