I don't understand the use case here.. We've had this kind of automation for years now without needing a heavy GPU and without risk of going rouge. The worst that might happen is an interface changes once every year or two and you need to update your scripts.

Microsoft so hell bent on throwing all of their AI-SH*T and seeing what sticks.

The point is that you can direct it at any of the 1bn+ websites without having to write any scripts.

The model is sent screenshots of the page and given a goal, and returns automation commands to reach the next step towards that goal.

Hmm.. Sounds like a solution looking for a problem to me.

If I could fine tune it to fill my work time sheets, I would count it as a big win!

[deleted]

if you think about it for more than 5 seconds you'll see a lot of applications, it's not that hard cmon.