When evaluating local models for tool calling in GenAI applications, the testing revealed significant variability in performance among different models. The Qwen 3 models emerged as top contenders, particularly for their balance of accuracy and speed, while OpenAI's GPT-4 set a high benchmark for tool selection. The study emphasizes the importance of model choice in achieving effective tool integration in AI applications.