Zyphra's ZONOS2 — an expressive multilingual text-to-speech model with high-fidelity voice cloning, trained on 6M+ hours of speech. Upload or record a few seconds of a voice and it will speak your text. Blog · Code