Just chiming in - the claims above are real, I have very similar numbers in a cluster of 2x GX10 I have access to.
Instructions to reproduce, and benchmarks here: https://forums.developer.nvidia.com/t/deepseek-v4-flash-offi...
Just chiming in - the claims above are real, I have very similar numbers in a cluster of 2x GX10 I have access to.
Instructions to reproduce, and benchmarks here: https://forums.developer.nvidia.com/t/deepseek-v4-flash-offi...