Quit Emailing Yourself

LoRA but with Only 13 Parameters??

4 min read | Saved February 14, 2026 | Copied!

tinylora 🤖 qwen 🤖 gguf 🤖 llama 🤖 reinforcement-learning 🤖

Do you care about this?

This article discusses TinyLoRA, a method developed by researchers at Meta that enhances a large language model's math reasoning by adjusting only 13 parameters. The findings suggest that minimal updates can yield significant improvements, though results may not apply broadly across other domains. It also explores the effectiveness of various GGUF models for coding tasks.

If you do, here's more

TinyLoRA aims to enhance large language models (LLMs) by updating just 13 parameters instead of retraining millions. Researchers from Meta’s AI unit, led by John X. Morris, claim this approach can improve math reasoning in models like Qwen without the extensive costs typically associated with reinforcement learning (RL). The method relies on a compact "trainable vector" projected through a fixed tensor, allowing for significant reductions in trainable parameters. In tests, Qwen2.5-7B-Instruct showed an accuracy increase from about 88% to 91% on GSM8K with just 13 parameters updated.

The findings indicate that RL is significantly more effective than supervised fine-tuning (SFT) at these small parameter sizes. While the results for math reasoning are promising, the authors caution that these claims primarily apply to math problems with clear reward signals, rather than more complex domains like science or creative writing. They also note an architectural advantage for Qwen models over Llama, with the former requiring fewer parameters to achieve similar performance. However, the reasons for this are still unclear, leaving a gap in understanding why Qwen excels in math reasoning.

On the coding front, there's growing interest in the GGUF format for models like Qwen3-Coder-Next. Various providers offer numerous GGUF variants, but assessing their quality remains challenging. The general consensus is that Q4 quantization is acceptable, while Q3 and below see a sharp decline in accuracy. Benchmark tests showed that the Unsloth version performs better than the official Qwen variant at Q4, but accuracy drops significantly when going below this level. The lack of efficient benchmarking tools for GGUF models complicates the evaluation process, leading to uncertainty about their true performance in practical applications.

Questions about this article

No questions yet.