Between two models the one with the shorter Minimum Description Length (MDL) will more likely generalize better