Quit Emailing Yourself

# python → inference → gpu → llm

1 link tagged with all of: python + inference + gpu + llm

GitHub - Mega4alik/ollm

oLLM is a lightweight Python library designed for large-context LLM inference, allowing users to run substantial models on consumer-grade GPUs without quantization. The latest update includes support for various models, improved VRAM management, and additional features like AutoInference and multimodal capabilities, making it suitable for tasks involving large datasets and complex processing.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

+ ollm llm ✓ inference ✓ python ✓ gpu ✓

Links

GitHub - Mega4alik/ollm