Hacker News

catigula 4 hours ago [ - ]

>“The behavior described in the paper cannot meaningfully be fixed, and any attempt would only weaken the model for defense,” said Moussouris, who criticized the export control directive as hasty, heavy-handed, and misguided.

This literally means the models are too dangerous to release, and yet he and they reached the opposite conclusion.

A lot of people have been saying this repeatedly for a long time.

switchbak 4 hours ago [ - ]

Or perhaps: we don't want our adversaries fixing all the security holes we rely on.

Or even: this is a good chance to stick it back to Anthropic.

ceejayoz 4 hours ago [ - ]

> This literally means the models are too dangerous to release…

Unless you believe Anthropic has an irreplacable wizard or genie or fairy chained up somewhere that other providers can't replicate, someone is going to release such a thing, and that someone might be a lot more cavalier about the safety of it.

catigula 2 hours ago [ - ]

Yes, this is the flawed logic Anthropic is using to do dangerous things; it's not lost on anyone.

ceejayoz an hour ago [ - ]

What's flawed about the logic?

Are we gonna drone strike China's datacenters when they release a similar model?

kylemaxwell 4 hours ago [ - ]

Mousssouris is not a "he".

catigula 3 hours ago [ - ]