Quit Emailing Yourself

# optimization → api

2 links tagged with all of: optimization + api

Click any tag below to further narrow down your results

Links

Codex Prompting Guide

This article provides guidance on optimizing the Codex model for coding tasks using the API. It covers recommended practices for prompting, tool usage, and code implementation to enhance performance and ensure efficient task completion.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

+ codex api ✓ optimization ✓ + coding + tools

GitHub - shuzhangzhong/HybriMoE-Preview

KTransformers is a Python-based framework designed for optimizing large language model (LLM) inference with an easy-to-use interface and extensibility, allowing users to inject optimized modules effortlessly. It supports various features such as multi-GPU setups, advanced quantization techniques, and integrates with existing APIs for seamless deployment. The framework aims to enhance performance for local deployments, particularly in resource-constrained environments, while fostering community contributions and ongoing development.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ ktransformers optimization ✓ + llm + gpu api ✓