Blog

a simple whitespace theme for academics

Understanding Transformers — The Architecture Behind Modern LLMs

A deep dive into the Transformer architecture — self-attention, multi-head attention, position encodings, and scaling laws.

6 min read · 2025

Understanding Transformers — The Architecture Behind Modern LLMs

A deep dive into the Transformer architecture — self-attention, multi-head attention, position encodings, and scaling laws.

6 min read · August 31, 2025

2025 · transformers deep-learning llms nlp · ai-ml