2026-03-12
10 min readMoE at the Edge: Making Sparse Activation Work on Your Phone
Exploring Mixture of Experts for sub-3B models targeting mobile deployment. How I converted Qwen3-0.6B to a sparse MoE with 4 experts, achieving comparable quality with a 19% parameter increase.