This is a great recipe for going out of business.

If the competitive risk is real, then are choosing between supplier risk (AI model access) and competitive risk.

When there isn't a zero-risk option, the question becomes which risk is smaller.

> If the competitive risk is real

Yes.

If.

Man I hope this tech FOMO eventually stops.

Companies generally fail because either their product doesn't meet a market need, or the market doesn't exist in the first place (possible because of bad timing), and not because they simply outran their competitors.

These aren't things fixed by using a frontier model to vibe code faster in lieu of one 5 months behind.

You can compete by being smart and using less-than-sota models and build a more solid business around them

I use whatever model is SOTA. I switch between them in order to avoid lock in.

>I use whatever model is SOTA. I switch between them in order to avoid lock in.

What's your competitive edge here? Shaving off an hour of a feature delivery? Not having to see the code that is produced?

Not sure about OP, I usually make Opus 4.8 on Extra thinking level implement features for me on a specific project, while I'm busy with other stuff.

For a change, I let DeepSeek V4 Pro implement it on Max thinking level. Nothing too out there - some DB migrations, some Django back end changes and Vue SPA front end changes.

Implementation time in total including tests was a few hours, so nothing too egregious. However, one of the migrations would break with pre-existing data, one of the column references in the entity was wrong, the API endpoint wasn't made consistently with the others in adjacent code (e.g. permission checks) and the front end had a Pinia state related issue and submitting one of the forms didn't work.

Tooling was run: ruff, ty, Oxfmt, Oxlint, also Docker build was green across the board, but the overall feature just didn't work. In both cases, sub-agents with clear context would review the code for serious/critical issues, at least three in parallel and do review loops until they spot nothing. The harnesses both has LSP integration.

Opus spent another hour fixing it, needed a few iterations, because I couldn't be bothered there.

> What's your competitive edge here? Shaving off an hour of a feature delivery? Not having to see the code that is produced?

The difference largely was not needing to waste time in fixing all sorts of subtle bugs that sub-optimal models will produce, worse yet if it was some sort of a serious project and those wouldn't have been spotted but instead that slop would have gotten shipped.

That said, Opus isn't ideal either and messed up a whole bunch when I was training some neural nets and try to process a bunch of satellite data and configure Garage to store them so that tiles can be served from a slow HDD and stuff like that. Obviously, it also needs a lot of babysitting in regards to UI looks, but it's better at the rest of development.

I think that DeepSeek V4 Pro and GLM 5.2 are cool though, it's just that you want as many checks and tests as you can throw at any given problem, or use languages that make shipping completely broken code increasingly likely.

Any competitive business will accept this risk if it gives them any type of edge no matter the duration of that edge. This is no different that using an exotic raw material.

Every big business in the world biases towards risk reduction and cost reduction over getting an edge.

Different businesses have different biases.

Eh, this isn’t really how businesses operate. How many businesses refuse to give devs large-spec machines? That’s very clear positive ROI.

I think it’s excessively charitable to assume businesses are uber-competent ROI-chasers. The expense people are eventually going to win on AI too, this blip of unrestricted AI budgets will be gone soon.

And thus, capitalism continues to roll on. Businesses are suppose to go out of business, its a feature.

they’re not supposed to, they’re just able to

Nearly spit out my coffee, thanks for the chuckle.

It’s ok to be amused, absent exaggeration. Spit takes happen in sitcoms.

They do happen in real life.

They are overused in sitcoms because it’s easy for actors to mimic on demand unlike several other reactions.