Ok, maybe I should have said "framebuffers" instead of images.

Because that Kitty protocol seems limited in that interaction was not one of their goals.

What's the point? If you want a framebuffer, ask the windowing system for one. Why go through a terminal emulator?

Because this one works through screen/tmux, and works over ssh without messing with DISPLAY variables and such.

First example on the Kitty image protocol sent above looks pretty interactive:

https://github.com/chase/awrit