The Distributed MoE Landscape: A Practical Survey of What Works and What Doesn't
Before building BlockZero, we surveyed the literature on distributed Mixture-of-Experts training. This is a practical synthesis of what we found — what methods exist, what problems they solve, where they fall short, and what gap BlockZero fills.