My take: we have no clue how this works and the performance can be down tomorrow just as well.

My hypothesis: the length of the prompt shrunk, yet maintained the same amount of information.