Hacker News

codezero a day ago [ - ]

the performance degradation I've seen isn't quality/completion but duration, I get good results but much less quickly than I did before 4.6. Still, it's just anecdata, but a lot of folks seem to feel the same.

refulgentis 21 hours ago [ - ]

Been reading posts like these for 3 years now. There’s multiple sites with #s. I’m willing to buy “I’m paying rent on someone’s agent harness and god knows what’s in the system prompt rn”, but in the face of numbers, gotta discount the anecdotal.

codezero 4 hours ago [ - ]

You're probably right. It's probably more likely that for some period of time I forgot that I switched to the large context Opus vs Sonnet and it was not needed for the level of complexity of my work.

coldtea 12 hours ago [ - ]

Yeah, why trust your actual experience over numbers? Nothing surer than synthetic benchmarks

refulgentis 11 hours ago [ - ]

Strawman, and, synthetic benchmark? :)