Quit Emailing Yourself

# evaluation → dataset → mrcr → language-model

1 link tagged with all of: evaluation + dataset + mrcr + language-model

Click any tag below to further narrow down your results

Links

openai/mrcr · Datasets at Hugging Face

OpenAI MRCR (Multi-round co-reference resolution) is a long context dataset designed to evaluate a language model's ability to identify multiple instances of similar requests embedded in a conversation. This dataset incorporates varying levels of complexity by including multiple identical asks within long, multi-turn dialogues, challenging the model to accurately differentiate and respond to specific instances. Implementation details and grading methods for assessing model performance are also provided.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ openai mrcr ✓ dataset ✓ language-model ✓ evaluation ✓