Yes, it's better on the Spark but the M5 is a lot closer than before with neural accelrators. After prompt processing, token generation speed on the M5 Max is 2.3x faster.
No Apple markup but you get the Nvidia market up instead. Prior to the recent Apple price increase due to RAM shortage, an M5 Max 128GB was a bargain if you want to run local LLMs.
DGX Spark is one, but really depends on how much you want to spend
273GB/s bandwidth vs 614 GB/s of the M5 Max. And you're getting a whole laptop.
$5k for DGX Spark as well.
Prompt processing time is better on the spark, which aligns more with coding (more reading than writing).
I spent less than $4k, OEM are better boxes for cooling, no apple markup, I get a real Linux system for stuff like k3s.
Yes, it's better on the Spark but the M5 is a lot closer than before with neural accelrators. After prompt processing, token generation speed on the M5 Max is 2.3x faster.
No Apple markup but you get the Nvidia market up instead. Prior to the recent Apple price increase due to RAM shortage, an M5 Max 128GB was a bargain if you want to run local LLMs.
I can get 2.5 spark for the price of the M5, will have better throughput and access to bigger models (more vram when running tensor parallel)