The other part of that is that while price per token may be going down, tokens per task is going up
For ~equivalent tasks/results, or because we’re expecting more or better from tasks?
The real measure should be cost per ~equivalent task result, not cost per token nor tokens per task.
For better performance of ~equivalent tasks. That's what all the harness tooling people are using does: (often) increasing output quality by significantly increasing token counts.
For ~equivalent tasks/results, or because we’re expecting more or better from tasks?
The real measure should be cost per ~equivalent task result, not cost per token nor tokens per task.
For better performance of ~equivalent tasks. That's what all the harness tooling people are using does: (often) increasing output quality by significantly increasing token counts.