Hacker News

Has anyone taken these open weight models from China and stripped the CCP out of them? I do not mean that snarkily, I mean review them thoroughly using techniques for weight introspection (concept activations) in response to things that one might expect would trigger deceptive/malicious behavior if the CCP had actually tried to implant context-specific behaviors (e.g. the accusation of generating vulnerable code if being used in American government applications, which I don't know if it was ever proven).

Just in case there are those who'd reflexively down vote this post, I'd just like to say that in a time of great national geopolitical rivalries, this kind of question is not unreasonable one to ask. Indeed, its applicable question whichever nation you live in.

dev_l1x_be 4 hours ago [ - ]

> Has anyone taken these open weight models from China and stripped the CCP out of them?

The CCP is not influencing my Rust code quality that much. Though I did notice all my lifetimes are now 'static because nothing is ever allowed to leave the party's ownership, unsafe blocks require approval from a central committee.

Honestly the scariest part is that shared mutable state is forbidden unless the state is doing the sharing.

Otherwise it is pretty ok.

tomaytotomato an hour ago [ - ]

Check out TNG on huggingface

They are a consultancy in Germany, but I watched a presentation on them tuning and removing bias from Deepseek models. It was quite interesting.

https://www.tngtech.com/en/about-us/news/release-of-deepseek...

(I upvoted your question as I agree)

Its not just code we need to worry about, its also subliminal messaging and other things.

justinclift 4 hours ago [ - ]

Sounds like something that heretic or similar might be useful for?

https://github.com/p-e-w/heretic

threethirtytwo 4 hours ago [ - ]

Eh even corporate created LLMs are suspect to corporate biases. Nothing is safe.

SubiculumCode 4 hours ago [ - ]

Everything is the same is not a serious argument because they are not the same.