Post
348
InternLM3-8B-instruct🔥 Trained on just 4T tokens, it outperforms Llama3.1-8B and Qwen2.5-7B in reasoning tasks, at 75% lower cost!
internlm/internlm3-67875827c377690c01a9131d
internlm/internlm3-67875827c377690c01a9131d