What do you think this particular prompt is evaluating for?
The more popular these particular evals are, the more likely the model will be trained for them.
What do you think this particular prompt is evaluating for?
The more popular these particular evals are, the more likely the model will be trained for them.
Sea https://simonwillison.net/2025/Nov/13/training-for-pelicans-...