I find the quality ebbs and flows even on the same model. My guess it is something to do with GPU availability but only guessing.
I find the quality ebbs and flows even on the same model. My guess it is something to do with GPU availability but only guessing.
Unless you're systematically repeating the exact same task, the most parsimonious explanation is that you're seeing natural variation based on different tasks, random sampling of tokens, etc.