Y
Hacker News
new
|
ask
|
show
|
jobs
byzantinegene
5 hours ago
[
-
]
we're already doing that, it's called distillation and how models like deepseek are trained.
Please enable JavaScript to continue using this application.