Cuiunbo's picture

Cuiunbo PRO

Cuiunbo

·

AI & ML interests

Anything

Recent Activity

replied to hexgrad's post about 20 hours ago

📣 Looking for labeled, high-quality synthetic audio/TTS data 📣 Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below. If your data exceeds quantity & quality thresholds and is approved into the next https://hf.co/hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops. What does this mean? If you've been calling closed-source TTS or audio API endpoints to: - Build voice agents - Make long-form audio, like audiobooks or podcasts - Handle customer support, etc Then YOU can contribute to the training mix and get useful artifacts in return. ❤️ More details at https://hf.co/hexgrad/Kokoro-82M/discussions/21

liked a model about 22 hours ago

openbmb/MiniCPM-o-2_6

updated a model 1 day ago

openbmb/MiniCPM-o-2_6

View all activity

Organizations

Posts 1

Post

2493

Introducing GUICourse! 🎉
By leveraging extensive OCR pretraining with grounding ability, we unlock the potential of parsing-free methods for GUIAgent.
📄 Paper: ( GUICourse: From General Vision Language Models to Versatile GUI Agents (2406.11317))
🌐 Github Repo: (https://github.com/yiye3/GUICourse)
📖 Dataset: ( yiye2023/GUIAct) / ( yiye2023/GUIChat) / ( yiye2023/GUIEnv)
🎯 Model: ( RhapsodyAI/minicpm-guidance) / ( RhapsodyAI/qwen_vl_guidance)

Collections 5

Papers 3

arxiv:2410.10594

arxiv:2408.01800

arxiv:2406.11317

models

None public yet

datasets

None public yet