adirik/styletts2

Generates speech from text

Input
Configure the inputs for the AI model.
0
1

Only used for long text inputs or in case of reference speaker, determines the prosody of the speaker. Use lower values to sample style based on previous or reference speech instead of text.

Seed for reproducibility

Text to convert to speech

0
1

Only used for long text inputs or in case of reference speaker, determines the timbre of the speaker. Use lower values to sample style based on previous or reference speech instead of text.

Replicate weights url for inference with model that is fine-tuned on new speakers. If provided, a reference speech must also be provided. If not provided, the default model will be used.

Reference speech to copy style from

0
50

Number of diffusion steps

0
5

Embedding scale, use higher values for pronounced emotion

Output
The generated output will appear here.

No output yet

Click "Generate" to create an output.

styletts2 - ikalos.ai