Hacker News

Y

Hacker News

new | ask | show | jobs

127 4 hours ago [ - ]

I get 150t/s peak, 120t/s avg with Qwen3.6 27B Q4 with a 4090 on Linux. Now that MTP has landed into llama.cpp.