And then someone needs to cross FFI border multiple times and gained perf is hurting again.

If what one's doing in scientific computing needs to cross the FFI border multiple times, they're doing it wrong...