Quit Emailing Yourself

# programming → cost-efficiency → evaluation

1 link tagged with all of: programming + cost-efficiency + evaluation

Click any tag below to further narrow down your results

Links

Evaluating LLMs for my personal use case

The author evaluates various large language models (LLMs) for personal use, focusing on practical tasks related to programming and sysadmin queries. By using real prompts from their bash history, they assess models based on cost, speed, and quality of responses, revealing insights about the effectiveness of open versus closed models and the role of reasoning in generating answers.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ llms evaluation ✓ programming ✓ + sysadmin cost-efficiency ✓