Hacker News

Y

Hacker News

new | ask | show | jobs

qcnguy 2 days ago [ - ]

Anthropic have done some great work on neural interpretability that gets at the core of this problem.