afiaka87/tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Input

Configure the inputs for the AI model.

Seed

Random seed which can be used to reproduce results.

Text

Text to speak.

preset

Which voice preset to use. See the documentation for more information.

voice_a

Selects the voice to use for generation. Use `random` to select a random voice. Use `custom_voice` to use a custom voice.

voice_b

(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.

voice_c

(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.

Cvvp Amount

How much the CVVP model should influence the output. Increasing this can in some cases reduce the likelyhood of multiple speakers. Defaults to 0 (disabled)

Custom Voice

(Optional) Create a custom voice based on an mp3 file of a speaker. Audio should be at least 15 seconds, only contain one speaker, and be in mp3 format. Overrides the `voice_a` input.

Output

The generated output will appear here.

No output yet

Click "Generate" to create an output.