I tried it with OpenCode and it is borderline incapable of using tool calls, so that might be why it is doing so bad on your test.

I just did the same. Absolutely awful. I assume OpenCode's heavy context is a problem, and it's probably better to use Liquid's own OpenCode alternative for this.

Where can I find that agent harness? A look at their Docs and asking Gemini yielded no results.

Edit: Is it this? https://github.com/Liquid4All/cookbook/tree/main/examples/lo...

FYI: Opencode is very well tuned for Qwen models, but I haven’t found it that rare for niche models to perform badly in it.