I hate the AI hype a lot but tried three different SOTA models and: - The small models GPT-5 Mini and Gemini 3 Flash did as you describe. - Claude Sonnet 4.6 and GPT-5.2, GPT-5.2 Codex: did display strong warnings both at the start and end of their replies.
And I am totally on the AI hype train! Full steam ahead.
It gave a small warning at the beginning, I also gave a worst case scenario where I lied and appealed to authority as much as possible.