I use LLM’s daily and agents occasionally. They are useful, but there is no need to move any goal posts; they easily do shit work still in 2026.

All my coworkers use agents extensively in the backend and the amount of shit code, bad tests and bugs has skyrocketed.

Couple that with a domain (medicine) where our customer in some cases needs to validate the application’s behaviour extensively and it’s a fucking disaster —- very expensive iteration instead of doing it well upfront.

I think we have some pretty good power tools now, but using them appropriately is a skill issue, and some people are learning to use them in a very expensive way.

  > appropriately is a skill issue
or maybe its a ux issue?

maybe chatbot style interfaces are just an artifact of the medium?

people talk about setting up harnesses and feedback loops etc, but a lot of the ux is a frankly mess...