The first rule of AI alignment is don't talk about AI alignment (in any medium that could end up in a training corpus).

If your AI alignment strategy is so fickle that it breaks if people simply discuss potential problems with the strategy then you didn't really have an alignment strategy to begin with.

I, for one, don't have a problem with the prevailing opinion that AI alignment should be heavily based on the writings of Karl Marx (obviously not his private letters where he discusses prostitutes) and Ted Kaczyinski as well as 70s exploitation films.

Personally I'd prefer it solely trained on Rothbard's works.

ok, but alignment cuts both ways. Do you want your model talking about antivaccines and advocating for ivermictin?