Hacker News

new | ask | show | jobs

aetherspawn a day ago [ - ]

Can you use the smaller Gemma 4B model as speculative decoding for the larger 31B model?

Why/why not?

MeetRickAI a day ago [ - ]

[dead]