"AI Systems Performance Engineering" might deserve a mention, even though it's not strictly CUDA.