I also feel like it's just a matter of time until someone cracks the nut of making agents better understand GUI's and more adept at using them.

Is there progress happening in that trajectory?

> Is there progress happening in that trajectory?

There was a recent Hackernews post which had a novel approach about making agents interact with GUI/computer-use

https://news.ycombinator.com/item?id=47125014: The First Fully General Computer Action Model : https://si.inc/posts/fdm1/

Hope this helps