Hacker News

Y

Hacker News

new | ask | show | jobs

ac29 19 hours ago [ - ]

The inference providers are running batch sizes much larger than 10