Hacker News

new | ask | show | jobs

CuriouslyC 8 hours ago [ - ]

It actually happens more with these large overparameterized models, because they have the capacity to memorize more than smaller models.