Quit Emailing Yourself

# language-models → cross-modal → interpretability

1 link tagged with all of: language-models + cross-modal + interpretability

Circuits Updates â October 2025

The Anthropic interpretability team shares preliminary research on cross-modal features in language models, particularly their ability to recognize and generate visual concepts in text-based formats like ASCII and SVG. They demonstrate how specific features can activate based on context and how steering these features can alter visual representations, leading to insights about the models' internal workings and potential future research directions.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

interpretability ✓ language-models ✓ cross-modal ✓ + svg + ascii

Links

Circuits Updates â October 2025