I am just a dilettante, but I imagined that eventually agents will be making API calls directly via browser extension, or headless browser.

I assumed everyone making these UI agents will create a library of each URL's API specification, trained by users.

Does that seem workable?