Even if multilinguality isn't solved, building a benchmark and then testing each model on it and posting the result may be a cheaper accelerator of competence in the language.