NIH/Multi-needleactiveLong context
NIH/Multi-needle
Metric: Retrieval Accuracy (higher is better)Introduced: 2024
About
Multi-needle retrieval benchmark extending the 'needle in a haystack' test to require simultaneous retrieval of multiple facts embedded in long contexts.