Hacker News

Does anyone use the super tiny models for anything ? Like in the 2billion or lower parameter level?

Speculative decoding[1]?