does this only work for vLLM or is generally applicable?
The algorithm works for MoE load balancing in general
The algorithm works for MoE load balancing in general