metadata
license: mit
pipeline_tag: audio-to-audio
FAcodec trained on 50k hours speech data, with more timbre diversity and better at reconstructing speakers from podcasts, videos, games or animations.
See main repository for example usages.
license: mit
pipeline_tag: audio-to-audio
FAcodec trained on 50k hours speech data, with more timbre diversity and better at reconstructing speakers from podcasts, videos, games or animations.
See main repository for example usages.