@AdinaY on Hugging Face: "InternLM3-8B-instruct🔥 Trained on just 4T tokens, it outperforms Llama3.1-8B…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

AdinaY

posted an update about 4 hours ago

Post

348

InternLM3-8B-instruct🔥 Trained on just 4T tokens, it outperforms Llama3.1-8B and Qwen2.5-7B in reasoning tasks, at 75% lower cost!
internlm/internlm3-67875827c377690c01a9131d

In this post

AdinaY Adina Yakefu