It's a little bit old, but challenge you opinions about what matters for LLM agentic coding:

https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/blob/ma...