afiaka87/tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Input
Configure the inputs for the AI model.

Random seed which can be used to reproduce results.

Text to speak.

Which voice preset to use. See the documentation for more information.

Selects the voice to use for generation. Use `random` to select a random voice. Use `custom_voice` to use a custom voice.

(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.

(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.

0
1

How much the CVVP model should influence the output. Increasing this can in some cases reduce the likelyhood of multiple speakers. Defaults to 0 (disabled)

(Optional) Create a custom voice based on an mp3 file of a speaker. Audio should be at least 15 seconds, only contain one speaker, and be in mp3 format. Overrides the `voice_a` input.

Output
The generated output will appear here.

No output yet

Click "Generate" to create an output.