Hacker News

Y

Hacker News

new | ask | show | jobs

mxwsn 6 hours ago [ - ]

No, there are more training tokens than parameters in LLMs. They are in the classical first descent setting.