I find it kind of hilarious that a 7 billion parameter AI model is necessary to automate the clicking of webpages. I mean, how broken is the software stack if we can't script things? We jumped the shark, clearly.

Yesterday I watched a video showing off 'My New Agent Coding Workflow'. The beginning involved prompting the IDE with a URL to download some additional prompt text files. I really didn't understand why you wouldn't just download the files. Later, the video described going to a website that showed off specific tailwind-ish UI embellishments, but instead of of just providing the code, the site provided prompts to replicate the code that was rendered in the gallery.

I felt like the author getting a cut of viewer token sales.

It's kinda out of control. Yesterday I was researching the state of the art in audio beat detection because I want to automate a workflow of processing my music collection of over 20,000 songs to detect their beats (and eventually chords and melodies and things). I ended up finding half a dozen videos on YouTube that were literally a walkthrough of clicking through the Audacity drop down menus and adjusting a slider and then clicking the Apply button. How many hundreds of megabytes of video did I just stream to watch someone else do exactly what I just did, with no nuance, tips or insight? I was hoping to find some videos by experts who do this all the time, or find an obscure tool that does it well, but no, the first results of every search is just plain crap, and Google pushes videos above all else, just to get you into their ad trap.

Because its not a software issue, it's a human social cooperation issue.

Companies don't want to support useful APIs for interoperability so its just easier to have an LLM bruteforce problems using the same interface that humans use.

half of GDP generated by all software and finance companies (ai and non ai) are artificial moats based in overengineering for the sake of selling something at a higher price than it would be if it was simpler.