Hacker News

What the hell is a "safety score for violence"?

I-M-S 4 hours ago [ - ]

It's making sure AI condemns violence perpetuated by people without power and sanctifies violence of those who have it.

Waterluvian 4 hours ago [ - ]

So long as those who have it deem it legal to perpetuate.

martin-t 2 hours ago [ - ]

They define what's legal.

States are the most prolific users of violence by far.

Computer0 4 hours ago [ - ]

ChatGPT will gladly defend any actions of the 'US government' from my testing.

murat124 5 hours ago [ - ]

I asked an AI. I thought they would know.

What the hell is a "safety score for violence"?

A “safety score for violence” is usually a risk rating used by platforms, AI systems, or moderation tools to estimate how likely a piece of content is to involve or promote violence. It’s not a universal standard—different companies use their own versions—but the idea is similar everywhere.

What it measures

A safety score typically evaluates whether text, images, or videos contain things like:

Threats of violence (“I’m going to hurt someone.”) Instructions for harming people Glorifying violent acts Descriptions of physical harm or abuse Planning or encouraging attacks

0xffff2 3 hours ago [ - ]

I still can't tell which direction this score goes... Does a decreasing score mean it is "less safe" (i.e. "more violent") or does it mean it is "less violent" (i.e. "more safe")?

0123456789ABCDE 5 hours ago [ - ]

read here: https://deploymentsafety.openai.com/gpt-5-4-thinking/disallo...