Yes, non-uniform workloads are supported! See `for_n_dynamic`.

The threads “busy-wait” by running an infinite loop in a lower energy state on modern CPUs.

And yes, there are more details in the actual implementation in the repository itself. This section, for example, describes the atomic variables needed to control all of the logic: https://github.com/ashvardanian/fork_union?tab=readme-ov-fil...

> The threads “busy-wait” by running an infinite loop in a lower energy state on modern CPUs.

Doesn't that still use part of the process's timeslots from the OS scheduler's POV?