Has anyone taken these open weight models from China and stripped the CCP out of them? I do not mean that snarkily, I mean review them thoroughly using techniques for weight introspection (concept activations) in response to things that one might expect would trigger deceptive/malicious behavior if the CCP had actually tried to implant context-specific behaviors (e.g. the accusation of generating vulnerable code if being used in American government applications, which I don't know if it was ever proven).
Just in case there are those who'd reflexively down vote this post, I'd just like to say that in a time of great national geopolitical rivalries, this kind of question is not unreasonable one to ask. Indeed, its applicable question whichever nation you live in.
> Has anyone taken these open weight models from China and stripped the CCP out of them?
The CCP is not influencing my Rust code quality that much. Though I did notice all my lifetimes are now 'static because nothing is ever allowed to leave the party's ownership, unsafe blocks require approval from a central committee.
Honestly the scariest part is that shared mutable state is forbidden unless the state is doing the sharing.
Otherwise it is pretty ok.
Check out TNG on huggingface
They are a consultancy in Germany, but I watched a presentation on them tuning and removing bias from Deepseek models. It was quite interesting.
https://www.tngtech.com/en/about-us/news/release-of-deepseek...
(I upvoted your question as I agree)
Its not just code we need to worry about, its also subliminal messaging and other things.
Sounds like something that heretic or similar might be useful for?
https://github.com/p-e-w/heretic
Eh even corporate created LLMs are suspect to corporate biases. Nothing is safe.
Everything is the same is not a serious argument because they are not the same.