Sorting the gaussians is the compute heavy part in gaussian splatting. So, Im guessing this will give only marginal improvement in terms rendering speed.

I'm not sure it does a sort. Each group of threads only handles a select number of gaussians

Yea, I think avoiding sorting is kinda the whole point here