Hacker News

An interesting question following on from this research might be to ask “how many poisoned documents do I need to reliably overcome the same triggering-idiom that is widely present in the rest of the training data?”

e.g. how many times do I need to give poisoned examples of

if err != nil { <bad code> }

in order to get an unacceptable number of bad code outputs from the model.