Lemonade is a tool designed to help users efficiently run local large language models (LLMs) by configuring advanced inference engines for their hardware, including NPUs and GPUs. It supports both GGUF and ONNX models, offers a user-friendly interface for model management, and is utilized by various organizations, from startups to large companies like AMD. The platform also provides an API and CLI for Python application integration, alongside extensive hardware support and community collaboration opportunities.
lemonade ✓
llm ✓
gpu ✓
open-source ✓
+ amd