We are thrilled to present the improved "ClearerVoice-Studio", an open-source platform designed to make speech processing easy use for everyone! Whether youβre working on speech enhancement, speech separation, speech super-resolution, or target speaker extraction, this unified platform has you covered.
** Why Choose ClearerVoice-Studio?**
- Pre-Trained Models: Includes cutting-edge pre-trained models, fine-tuned on extensive, high-quality datasets. No need to start from scratch! - Ease of Use: Designed for seamless integration with your projects, offering a simple yet flexible interface for inference and training.
- Enhance noisy speech recordings to achieve crystal-clear quality. - Separate speech from complex audio mixtures with ease. - Transform low-resolution audio into high-resolution audio. A full upscaled LJSpeech-1.1-48kHz dataset can be downloaded from alibabasglab/LJSpeech-1.1-48kHz . - Extract target speaker voices with precision using audio-visual models.
**Join Us in Growing ClearerVoice-Studio!**
We believe in the power of open-source collaboration. By starring our GitHub repository and sharing ClearerVoice-Studio with your network, you can help us grow this community-driven platform.
**Support us by:**
- Starring it on GitHub. - Exploring and contributing to our codebase . - Sharing your feedback and use cases to make the platform even better. - Joining our community discussions to exchange ideas and innovations. - Together, letβs push the boundaries of speech processing! Thank you for your support! :sparkling_heart:
π ClearerVoice-Studio New Feature: Speech Super-Resolution with MossFormer2 ! π Weβre excited to announce that ClearerVoice-Studio now supports speech super-resolution, powered by our latest MossFormer2-based model! Whatβs New?
π Convert Low-Resolution to High-Resolution Audio: Transform low-resolution audio (effective sampling rate β₯ 16 kHz) into crystal-clear, high-resolution audio at 48 kHz.
π€ Cutting-Edge Technology: Leverages the MossFormer2 model plus HiFi-GAN, optimised for generating high-quality audio with enhanced perceptual clarity.
π§ Enhanced Listening Experience: Perfect for speech enhancement, content restoration, and high-fidelity audio applications.
π Try It Out! Upgrade to the latest version of ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio) to experience this powerful feature. Check out the updated documentation and examples in our repository.
Let us know your thoughts, feedback, or feature requests in the Issues section.
π ClearerVoice-Studio New Feature: Speech Super-Resolution with MossFormer2 ! π Weβre excited to announce that ClearerVoice-Studio now supports speech super-resolution, powered by our latest MossFormer2-based model! Whatβs New?
π Convert Low-Resolution to High-Resolution Audio: Transform low-resolution audio (effective sampling rate β₯ 16 kHz) into crystal-clear, high-resolution audio at 48 kHz.
π€ Cutting-Edge Technology: Leverages the MossFormer2 model plus HiFi-GAN, optimised for generating high-quality audio with enhanced perceptual clarity.
π§ Enhanced Listening Experience: Perfect for speech enhancement, content restoration, and high-fidelity audio applications.
π Try It Out! Upgrade to the latest version of ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio) to experience this powerful feature. Check out the updated documentation and examples in our repository.
Let us know your thoughts, feedback, or feature requests in the Issues section.