The author evaluates various large language models (LLMs) for personal use, focusing on practical tasks related to programming and sysadmin queries. By using real prompts from their bash history, they assess models based on cost, speed, and quality of responses, revealing insights about the effectiveness of open versus closed models and the role of reasoning in generating answers.