Open text-to-speech model needs just seconds of audio to clone your voice

Github:

Not sure if there are existing easily accessible solutions, but if not, while it could be useful for content creation, it will definitely be abused as well.

2 Likes