Hacker News

Y

Hacker News

new | ask | show | jobs

nl 2 days ago [ - ]

This ignores that reinforcement learning radically changes the training objective