I read that as "languages under-represented in the training set".