Quit Emailing Yourself

# reasoning → large-language-models → text-based-games

1 link tagged with all of: reasoning + large-language-models + text-based-games

Click any tag below to further narrow down your results

Links

TextQuests: How Good are LLMs at Text-Based Video Games?

TextQuests introduces a benchmark to evaluate the performance of Large Language Models (LLMs) in classic text-based video games, focusing on their ability to engage in long-context reasoning and learning through exploration. The evaluation involves assessing agents' progress and ethical behavior across various interactive fiction games, revealing challenges such as hallucination and inefficiency in dynamic thinking. The aim is to help researchers better understand LLM capabilities in complex, exploratory environments.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

large-language-models ✓ text-based-games ✓ + evaluation reasoning ✓ + exploration