Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is Piper currently the best open source TTS model? I occasionally review open models to see if they match elevenlabs and have been disappointed. However, Piper sounds better than the last time I listened around.


Listening to the piper demos [1] and comparing to coqui [2], I'd say coqui sounds better to me, but I'd love to hear others' opinions. Looks like Piper's latest commits were 3 months ago [3] while Coqui's were 8 months ago [4], so they both seem similar in recency. In terms of ease of use though, especially with this project, personally Piper seems way less overwhelming.

[1] https://rhasspy.github.io/piper-samples/ [2] https://huggingface.co/spaces/coqui/xtts [3] https://github.com/rhasspy/piper [4] https://github.com/coqui-ai/TTS


For anyone who is interested, CoquiTTS (formerly, MozillaTTS) was great, but the project isn't maintained anymore (athough there's been some confusion about whether or not it's active. See: https://github.com/coqui-ai/TTS/issues/4022).

Looks like there's an effort to keep an actively maintained fork here, though: https://github.com/idiap/coqui-ai-TTS




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: