micro-gpt

# python → transformers → micro-gpt

1 link tagged with all of: python + transformers + micro-gpt

Click any tag below to further narrow down your results

Links

Andrej Karpathy Just Built an Entire GPT in 243 Lines of Python | by Sumit Pandey | in Towards Deep Learning

This article breaks down Andrej Karpathy’s zero-dependency, 243-line GPT implementation in plain Python. It explains how each part—tokenizer, autograd engine, embeddings, attention mechanism, residual connections, and MLP—mirrors a full-scale transformer on a tiny dataset of baby names.

Last saved Apr 15, 2026 · 7 min read

python transformers + autograd micro-gpt + deep-learning