view post Post 1363 Just created a Gradio space for playing with the new OAI realtime voice API! freddyaboulton/openai-realtime-voice See translation š 2 2 + Reply
view post Post 701 Gemini can talk š£ļøCheck out the new multimodal API from Google on @akhaliq 's anychat or my space. It's very fast and smart šhttps://huggingface.co/spaces/freddyaboulton/gemini-voicehttps://huggingface.co/spaces/akhaliq/anychat See translation 1 reply Ā· š 1 1 + Reply
view post Post 1977 Version 0.0.21 of gradio-pdf now properly loads chinese characters! See translation š„ 6 6 š 3 3 š¤ 3 3 + Reply
view post Post 1554 Hello Llama 3.2! š£ļøš¦ Build a Siri-like coding assistant that responds to "Hello Llama" in 100 lines of python! All with Gradio, webRTC š freddyaboulton/hey-llama-code-editor See translation š 8 8 š 4 4 + Reply
view post Post 1111 Just created a cookbook of real time audio/video spaces created using Gradio and WebRTC ā”ļø Use this and the [docs](https://freddyaboulton.github.io/gradio-webrtc/) to get started building the next gen of AI apps! freddyaboulton/gradio-webrtc-cookbook-6758ba7745aeca7b1be7de0f See translation 2 replies Ā· š„ 3 3 š 2 2 š 1 1 + Reply
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. ā¢ 8 items ā¢ Updated Dec 2, 2024 ā¢ 49