Hacker News

new | ask | show | jobs

agenticup 14 hours ago [ - ]

qwen 3.6 27b and qen35b a3b work like magic, if we get dpark speculative decoding versions of these models it will further improve the throughput