Hacker News

How does this compare to Piper?

Appears to use a proprietary codec as well.

Piper is a VAE model which is quite robotic. This is a speech language model, which sound quite realistic.

You can listen to the model on this video => https://www.youtube.com/watch?v=YAB3hCtu5wE

Then may I suggest that this should be edited?

> Audio Codec: NeuCodec - our proprietary neural audio codec that achieves exceptional audio quality at low bitrates using a single codebook

This says it was trained on proprietary data.