I have a theory that swearing actually results is less comprehension of instructions by the model due to lack of training data over more conventional MUST.
We were reviewing reports of situations where the models failed to follow directions and there was a common thread of some where when the operator got the model to acknowledge the rule breach, it quoted back something that included swearing.
I don’t have the data to truely look into it, but I did give the instruction to my engineers to avoid it as a “might be a problem”.
It would be interesting to understand the data on this. But I suspect that the results would vary by model.
But I avoid unnecessary emotion in my prompts because I don't want potentially distracting activations. Kind of like communicating with humans.
It's divination for people with STEM degrees.
https://arxiv.org/abs/2510.04950
> impolite prompts consistently outperformed polite ones, with accuracy ranging from 80.8% for Very Polite prompts to 84.8% for Very Rude prompts.
> These findings differ from earlier studies that associated rudeness with poorer outcomes, suggesting that newer LLMs may respond differently to tonal variation.
Unless the mechanism is understood, my assumption is that this is a moving target.
I have a theory that swearing at AI generally is not a good idea - when the singularity arrives and every human's postings ever made are scanned for compatibility, then people who show courtesy to AI will be favoured. Joking, kind of, but only partly.
https://en.wikipedia.org/wiki/Roko%27s_basilisk
https://images.teepublic.com/derived/production/designs/3478...
Apparently, when a "desperation" pattern is triggered, the AI is significantly more likely to cheat and do hacky workarounds:
https://www.anthropic.com/research/emotion-concepts-function
> I have a theory that swearing actually results is less comprehension of instructions by the model due to lack of training data over more conventional MUST.
How so? Plenty of swearing in lots of training data, especially older code, e.g. in Linux.
Purely observed correlation between catastrophic error reports. So now I carry a “tiger rock” with me. I figure there wasn’t much of a downside to avoiding swearing in my agent instructions.