But you're using a 3rd party quant of unknown quality. Nvidia is only providing weights as BF16 and FP8.