As everybody knows, key strokes and mouse movements are the things that solve problems, definitely the data worth capturing for AI training.

See: https://si.inc/posts/fdm1/

If they captured display output as well, it could be a very useful dataset for generalized computer use.

They used to say the same thing about text, it turned out that after all training the best thing they could achieve is the `ccc` compiler.