Ironically the few people not scamming you for cache reads are Deepseek.
Everyone else charges a ridiculous amount but Deepseeks API is $0.003625 / M tok.
I'm surprised no one talks about this because of how significant it is. GPT 5.5 for example costs a ridiculous $0.50 / M tok cached. It's literally almost 140 times cheaper which matters a lot for tool calls.
it's a temporary promo, deepseek will return to only 10x cheaper after.
Yes Deepseek V4 pro is currently on discount.
> The deepseek-v4-pro model is currently offered at a 75% discount, extended until 2026/05/31 15:59 UTC.
However even when the discount ends its still very cheap. It will go back to $0.0145 / M cache hit. That's still 34x cheaper than GPT 5.5.
doesn't matter when subscriptions get cache reads for free, it is only really worth it if it's x340 cheaper otherwise I'd be paying $120 a day, 90% of the cost being cache reads for any top level opensource model.