Hacker News

Y

Hacker News

new | ask | show | jobs

WithinReason 4 days ago [ - ]

Unless you train them with RL in the right task specifically