ⓍTTS trained on Roxy's voice

How to use

Init from XttsConfig and load checkpoint:

import torchaudio
from TTS.tts.configs.xtts_config import XttsConfig
from TTS.tts.models.xtts import Xtts

SPEAKER_REFERENCE = "reference.wav"

config = XttsConfig()
config.load_json("config.json")
model = Xtts.init_from_config(config)
model.load_checkpoint(config, checkpoint_path="best_model_180.pth", vocab_path="vocab.json", use_deepspeed=False)
model.cuda()

gpt_cond_latent, speaker_embedding = model.get_conditioning_latents(audio_path=[SPEAKER_REFERENCE]) # Use sample of Roxy's voice here

out = model.inference(
    "It took me quite a long time to develop a voice and now that I have it I am not going to be silent.",
    "en",
    gpt_cond_latent,
    speaker_embedding,
    temperature=0.7, # Add custom parameters here
)
torchaudio.save("output.wav", torch.tensor(out["wav"]).unsqueeze(0), 24000)
Downloads last month
4
Inference API
Unable to determine this model's library. Check the docs .

Model tree for Anexdeus/Roxy_Migurdia_coqui_XTTS

Base model

coqui/XTTS-v2
Finetuned
(30)
this model