StructMoE: augmenting MoEs with hierarchically routed low rank experts

Introducing hierarchical routing and low-rank experts to enhance the efficiency and performance of MoE models.

NeurIPS
December 14, 2024

Latest publications