Theoretical FP32 performance of AMD EPYC 9965 is double that of A100: 41.2 TFLOP/s vs 19.5 TFLOP/s

Isn't that because the A100 is optimizing for memory bandwidth per TF?