Not in many tasks. I use deepseek as a fallback in https://phrasing.app and it’s always very apparent when it happen (due to mistakes/clear performance drop off)
Not in many tasks. I use deepseek as a fallback in https://phrasing.app and it’s always very apparent when it happen (due to mistakes/clear performance drop off)
Interesting - which models specifically? I'd be interested in using mistral over deepseek if it was competitive (guess I need to go benchmark)
I use small, large, an medium-3.5 depending on the task