These are chips that become e-waste the second a better a model comes out, and nvidia is already limited by TSMC capacity.

This is a ridiculous mindset. Llama 3.1 8B can do lots of things today and it'll still be able to do those things tomorrow.

If you baked one of these into a smart speaker that could call tools to control lights and play music, it will still be able to do that when Llama 4 or 5 or 6 comes out.

If you pay $1,500 for a Mistral ASIC that is beaten by a $15 Qwen ASIC that comes out six months later, you'd be feeling pretty dang ridiculous.

I'm equally capable of making up numbers to support my perspective but I don't see the point.

The point is that the GP's mindset is not very ridiculous if you value things by a price/utility ratio. Software and hardware advancements will lead to buyer's remorse faster than people get an ROI from local inference.

SW and HW advancements will bring this topic in the "good enough for vast majority" field, thus making GP point moot. You don't care if your LLM ASIC chip is not the latest one because it works for the use you purchased it for. The highly dynamical nature of LLM itself will make part of the advantage of upgradable software not that interesting anymorw. [1]

[1] although security might be a big enough reason for upgrades to still be required

They'll be perfect for an appliance like the Rick and Morty butter robot.

these aren’t made for general chatbot use

Only in VC backed funding land.

In the real world, theres talking refrigerators who dont need to know how to recite shakespeare.

On the upside, Shakespeare isn't going to change soon.

So you're saying we should burn Shakespeare onto a chip? /s