Hacker News

kevinlu1248 2 months ago [ - ]

Honestly I think we can improve our training throughput drastically via a few more optimizations but we've been spending most of our time on model quality improvements instead.