They don’t have the memory bandwidth
256 GB/s memory bandwidth is low but it still does around 40 tokens per sec with gpt-oss, good enough for local apps
256 GB/s memory bandwidth is low but it still does around 40 tokens per sec with gpt-oss, good enough for local apps