> some evil freaks to use ablated offline model for some nasty acts

If this is a serious concern, why hasn't some red teaming effort demonstrated this possibility already? The fact of the matter is that ablation can't give a model world knowledge it doesn't have as part of training, it can only make the model confabulate. The "nasty" areas of concern are most notable for their world-knowledge requirements, which is where local models are at their weakest anyway.

> why hasn't some red teaming effort demonstrated this possibility already?

I'm sure they have but as usual we are a reactive society than proactive. Only when incident has occurred then we have momentum to act.