Regarding the section on Python and high-level CUDA, anyone interested should maybe first take a peek at Warp, which I’m guessing is too new to have a book yet. Warp lets you write CUDA kernels directly in Python, and it’s a breeze to get started. https://github.com/nvidia/warp
It's a bit confusing now with Numba Cuda also being officially maintained by Nvidia. Also Cuda Python, which looks older.
Which of these - warp, numba, cp, is the best bet for a beginner?
https://nvidia.github.io/numba-cuda/
https://developer.nvidia.com/cuda/python