Did you ever consider using something like https://github.com/jenissimo/unfake.js/ in your process, to make it more proper-pixel-art?

Maybe to process the Nano-Banana generated dataset before fine-tuning, and then also to fix the generated Qwen output?