Hacker News

Y

Hacker News

new | ask | show | jobs

3abiton 5 hours ago [ - ]

To be fair, llama.cpp had this feature for over a year now. It just applies to GGUF.