Quit Emailing Yourself

How much do language models memorize?

2 min read | Saved October 29, 2025 | Copied!

language-models 🤖 memorization 🤖 generalization 🤖 scaling-laws 🤖 data-analysis 🤖

Do you care about this?

A new method for estimating the memorization capacity of language models is proposed, distinguishing between unintended memorization and generalization. The study finds that GPT-style models have an estimated capacity of 3.6 bits per parameter, revealing that models memorize data until their capacity is reached, after which generalization begins to take precedence.

If you do, here's more

Click "Generate Summary" to create a detailed 2-4 paragraph summary of this article.

Questions about this article

No questions yet.