It's a little bit old, but challenge you opinions about what matters for LLM agentic coding:
https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/blob/ma...
It's a little bit old, but challenge you opinions about what matters for LLM agentic coding:
https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/blob/ma...