If you using opencode or similar you can just temporarily switch models -- in the same session -- to something that has vision and have it look at your image. And then switch back.
If you using opencode or similar you can just temporarily switch models -- in the same session -- to something that has vision and have it look at your image. And then switch back.
Or create an agent or subagent that just looks at images, and specify a vision model for that agent.
I don't see how that helps, I would still need to somehow get the image into the coding model's context.