AI & ML interests

None defined yet.

Recent Activity

Zhenru  updated a model about 9 hours ago
Qwen/Qwen2.5-Math-7B-PRM800K
Zhenru  updated a model about 9 hours ago
Qwen/Qwen2.5-Math-PRM-72B
Zhenru  updated a model about 9 hours ago
Qwen/Qwen2.5-Math-PRM-7B
View all activity

Qwen's activity

AdinaY 
posted an update about 2 hours ago
AdinaY 
posted an update about 19 hours ago
view post
Post
1008
MiniMax, the company behind Hailuo_AI, has joined the open source community by releasing both models and demos of MiniMax-Text-01 & MiniMax-VL-01🔥
- Model
MiniMaxAI/MiniMax-VL-01
MiniMaxAI/MiniMax-Text-01
- Demo
MiniMaxAI/MiniMax-VL-01
MiniMaxAI/MiniMax-Text-01

✨ MiniMax-text-01:
- 456B with 45.9B activated per token
- Combines Lightning Attention, Softmax Attention, and MoE for optimal performance
- Training context up to 1M tokens, inference handles 4M tokens

✨ MiniMax-VL-01:
- ViT-MLP-LLM framework ( non-transformer👀)
- Handles image inputs from 336×336 to 2016×2016
- 694M image-caption pairs + 512B tokens processed across 4 stages
AdinaY 
posted an update 1 day ago
view post
Post
1483
MiniCPM-o2.6 🔥 an end-side multimodal LLMs released by OpenBMB from the Chinese community
Model: openbmb/MiniCPM-o-2_6
✨ Real-time English/Chinese conversation, emotion control and ASR/STT
✨ Real-time video/audio understanding
✨ Processes up to 1.8M pixels, leads OCRBench & supports 30+ languages
Tonic 
posted an update 1 day ago
view post
Post
1381
🙋🏻‍♂️Hey there folks , Open LLM Europe just released Lucie 7B-Instruct model , a billingual instruct model trained on open data ! You can check out my unofficial demo here while we wait for the official inference api from the group : Tonic/Lucie-7B hope you like it 🚀
KnutJaegersberg 
posted an update 2 days ago