This is 3D LiDAR output (multimodal) from 2D images.

LiDAR is the technology used to do spatial capture. The output is just point clouds of surfaces. So they’re generating surface point clouds from video