Yeah, this is the classic silicon valley strategy of selling at a loss and then once they have captured the market inflate prices.

See Uber, Netflix, etc.

I don't see them capturing anything at this point. If inference was profitable then they could compete on price/model and capture the market. Then increase price and pay back the model training.

Feels like they are just pulling in as much as they can whilst competing on capabilities instead. At which point its a case of who can last the longest.

Doesn't feel like Uber/Netflix.

They're trying to do it more like a cartel where all major providers raise prices in unison. The intention is (probably) less specific entrapment and more getting people addicted to a fast LLM. From there, they all play with pricing to give a semblance of choice, without actually overly undercutting each other. At least, in the west.

This is all done to help valuations. The main revenue source are the investor dollars at the prospect that this industry will very soon actually be sustainable and highly profitable. It won't be, but if very soon stays around the corner consistently, the investor dollars keep coming.

This is a constantly repeated conspiracy theory and is not true at all. The api costs do increase but aggregate costs per task decrease. The question is: do people need lower intelligence models at all? The answer is a resounding NO!

How many people do you see using haiku or sonnet? I see very few and most people default to the latest model and just play with thinking effort. I think three layers are good enough and supporting more is not a good UX.

Do I need the most intelligent model to generate boilerplate code, which is my main usage for AI? Resounding No.

For my use case a model from a year ago is good enough

Are you only considering coding use cases?

Many enterprise use cases, such as simple data extraction, are well served by cheaper models.

I... use them all the time: plan with a more advanced model, build with a cheaper one. Anthropic literally packages a metamodel (opusplan) for that pattern.

Also: calling the SV blitzscaling strategy of using VC money to fund loss leader products with the goal of building a monopoly via dumping a conspiracy is quite the position given there's entire books written in the topic...