I am getting 98.6% cache hit ratio on deepseek-v4-flash with opencode
That’s impressive!
On the sheer performance it’s comparable to Opus ?
Here are my stats (from DeepSeek directly, with a script I wrote). The prices are what equivalent Sonnet usage would have cost, the actual amount I paid was $10. On performance, DeepSeek V4 Pro is comparable to Sonnet for me.
./cost.py amount-2026-5.csv 0.3 3.75 15 input_cache_hit_tokens: 472,971,520 tokens -> $141.8915 input_cache_miss_tokens: 13,299,013 tokens -> $49.8713 output_tokens: 3,334,962 tokens -> $50.0244 cache hit rate: 97.27% (472,971,520/486,270,533) cache miss rate: 2.73% (13,299,013/486,270,533) total: $241.7872
[flagged]
out of curiosity, how do you measure cache hit rate in opencode ?
opencode stats
So the calculation is:
Total input token = input + cache read + cache write Cache hit rate = cache read / total input token.
That is 71% in my very limited use of opencode.
The first
That’s impressive!
On the sheer performance it’s comparable to Opus ?
Here are my stats (from DeepSeek directly, with a script I wrote). The prices are what equivalent Sonnet usage would have cost, the actual amount I paid was $10. On performance, DeepSeek V4 Pro is comparable to Sonnet for me.
All of this usage was with an OpenCode subagent exclusively.[flagged]
out of curiosity, how do you measure cache hit rate in opencode ?
opencode stats
So the calculation is:
Total input token = input + cache read + cache write Cache hit rate = cache read / total input token.
That is 71% in my very limited use of opencode.
The first