Hacker News

jackxlau 12 hours ago [ - ]

In my own testing I have seen peak performance happen usually within 15-20% of the intended context limit, albeit there are a few optimizations depending on the task quality.