Quit Emailing Yourself

# deep-learning → efficiency

3 links tagged with all of: deep-learning + efficiency

Click any tag below to further narrow down your results

Links

Deep Think with Confidence

Deep Think with Confidence (DeepConf) is introduced as a method to improve reasoning efficiency and performance in large language models by using internal confidence signals to filter out low-quality reasoning traces. It requires no additional training or tuning and can be easily integrated into existing systems. Evaluations show significant accuracy improvements and a reduction in generated tokens on various reasoning tasks.

Saved by markshervey · Last saved January 12, 2026 · 1 min read

+ machine-learning + large-language-models efficiency ✓ + reasoning deep-learning ✓

Deep Think with Confidence

Deep Think with Confidence (DeepConf) is a novel parallel thinking method that improves reasoning performance and efficiency of large language models (LLMs) by utilizing internal confidence signals to filter out low-quality reasoning traces. It can be integrated into existing frameworks without the need for additional training or tuning, achieving up to 99.9% accuracy on the AIME 2025 dataset while significantly reducing token generation. A real-time demo is available using the Qwen3-8B model with parallel thinking on the HMMT'25 dataset.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

deep-learning ✓ + llm + reasoning efficiency ✓ + parallel-thinking

GitHub - visresearch/LLaVA-STF: The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"

The repository provides an implementation of the method "Learning Compact Vision Tokens for Efficient Large Multimodal Models," which enhances inference efficiency by fusing spatial-adjacent vision tokens and introducing a Multi-Block Token Fusion module. Experimental results show that this approach achieves competitive performance on various vision-language benchmarks while using only 25% of the baseline vision tokens.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

+ multimodal + vision-tokens + inference efficiency ✓ deep-learning ✓