does this only work for vLLM or is generally applicable?

The algorithm works for MoE load balancing in general