Hacker News

composer is competitive with around opus 4.5 in feeling?

largely lags behind opus4.7/gpt5.4, but is respectable, and generally outperforms the glm/qwen equivalents anecdotally despite benchmarks.

fails to follow instructions more often, and is less code critical, but performs okay if you can decompose the task to smaller problem spaces. i.e. only do manual review, only do typechecking, only do specific component. etc

https://artificialanalysis.ai/agents/coding-agents?coding-ag...