Yeah, I've run a local Kokoro instance, and it doesn't work with Firefox. This uses Kokoro under the hood.

The demo clip is static and has the Kokoro output encoded as the audio track. It's not Kokoro running and generating it in your browser in real time.