I also feel like it's just a matter of time until someone cracks the nut of making agents better understand GUI's and more adept at using them.
Is there progress happening in that trajectory?
I also feel like it's just a matter of time until someone cracks the nut of making agents better understand GUI's and more adept at using them.
Is there progress happening in that trajectory?
> Is there progress happening in that trajectory?
There was a recent Hackernews post which had a novel approach about making agents interact with GUI/computer-use
https://news.ycombinator.com/item?id=47125014: The First Fully General Computer Action Model : https://si.inc/posts/fdm1/
Hope this helps