Being unchanged for decades means that the training data should provide great results even for the smaller models.

It means there's also plenty of bad examples in the training data to learn the wrong lessons from though.