I really need this as an API.
Turns out, to use Claude Agents SDK, you need to have a vision enabled API. If Deepseek API could see, it can fully drive Claude Code and Claude Agents SDK. A project I'm working on relies on a Claude-in-CloudflareWorker setup and I've been relying on Qwen and gemini flash lite, both more expensive than Deepseek.
Can't wait to have it available on deepseek.
Xiaomi Mimo v2.5 is my favorite alternative. Matches DS v4 Flash (official) pricing exactly and supports image/audio/video input.
same here. I am using Gemini 2.5 Flash as VSCode "vision proivder" for Deepseek V4 Pro, but it is expensive and not accurate. can't wait for native Deepseek vision.
Have you looked at MiniMax or MiMo? Available today via OpenRouter, and it’ll make the path to porting to DeepSeek a line change https://openrouter.ai/collections/vision-models