IV — What Makes an LLM → Chapter 17
FROM SYSTEMS TO FRONTIER ML

Mixture-of-Experts

Routing, sparse activation, why capacity ≠ compute. Load-bearing in every 2025 frontier model.

§1 Routing + sparse activation §2 Capacity vs compute — the asymmetric scaling §3 Expert parallelism + load balancing

← ALL CHAPTERS