Quit Emailing Yourself

# llm → tokenization

2 links tagged with all of: llm + tokenization

Click any tag below to further narrow down your results

Links

How LLM Inference Works

This article explains how Large Language Models (LLMs) process prompts from tokenization to response generation. It covers the transformer architecture, including self-attention and feed-forward networks, and details the importance of the KV cache in optimizing performance.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

llm ✓ + inference tokenization ✓ + transformer + kv-cache

GitHub - toon-format/toon: 🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TOON is a compact format designed for encoding JSON data, making it easier for large language models to process. It combines YAML's structure with a CSV-like layout to reduce token usage while maintaining accuracy. While effective for uniform arrays, it's less suitable for deeply nested data.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

+ json llm ✓ + data-format tokenization ✓ + yaml