Hacker News

The main thing here is, there are doing it really cheap!

I heavily using Deepseek V4 Pro for a personal project because I cannot afford Opus, and spent ~1B token last two weeks for just $40 which would've costed ~$1300 using Opus 4.8. Realistically Opus cost will be lower assuming more "intelligent" model would've produced less code with fewer conversation but I doubt it'll be cheaper than ~$500.

I'm curious to know how they can they offer at such a cheap price. Some say it's electricity surplus in China and/or government subsidy. It'll be a very interesting read if there's an extensive study on their economics.

   1.1B (cache reads) * $0.5 = ~576
   39M (ache miss) * $5 = ~199
   21M (output) * $25 = ~529
   Opus 4.8 = 1304

   1.1B (cache reads) * $0.003625 = ~4.17
   39M (ache miss) * $0.435 = ~17.3
   21M (output) * $0.87 = ~18.4
   Deepseek V4 Pro = ~40

toraway 11 minutes ago [ - ]

Nice comparison, I've been super impressed by both Deepseek V4 models, particularly Flash given the crazy value for price vs. performance.

It can definitely do "stupid" things and get off track at times but I've found it can easily handle routine web dev tasks like 9/10 times, and using Pro to handle any large refactors/tricky bugs/etc.

The only really negatives are both models (but particularly Flash V4) occasionally have a strange issue parsing instructions, almost like a "language barrier" where a clear instruction gets bizarrely misinterpreted in a subtle but very problematic way. It feels a bit like a SOTA model a year ago where they'd occasionally just miss the plot entirely while still being technically competent but misdirected.

Also not really a negative, but I can't handle watching the reasoning output on Pro anymore haha. It like actually started stressing me out and giving me heartburn watching it get something right on the first or second idea... and then spend like 5 minutes looping through a dozen extremely dumb guesses with "But wait.... Or... Unless..." lol.

Even if I knew it would (usually) end up where it should I just couldn't stand seeing it consider, like, deleting my prod DB and recreating tables manually/ripping out some critical dependency/etc without interupting it to say "Holy shit you had it right the first time, for the love of god just start doing the thing now and move on".