Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
6
18
Ashvini Kumar Jindal
akjindal53244
Follow
lx9182's profile picture
Haleshot's profile picture
julien-c's profile picture
61 followers
·
4 following
akjindal53244
akjindal53244
AI & ML interests
NLP
Recent Activity
new
activity
about 1 month ago
akjindal53244/Llama-3.1-Storm-8B:
Adding Evaluation Results
reacted
to
albertvillanova
's
post
with 👍
3 months ago
🚨 We’ve just released a new tool to compare the performance of models in the 🤗 Open LLM Leaderboard: the Comparator 🎉 https://huggingface.co/spaces/open-llm-leaderboard/comparator Want to see how two different versions of LLaMA stack up? Let’s walk through a step-by-step comparison of LLaMA-3.1 and LLaMA-3.2. 🦙🧵👇 1/ Load the Models' Results - Go to the 🤗 Open LLM Leaderboard Comparator: https://huggingface.co/spaces/open-llm-leaderboard/comparator - Search for "LLaMA-3.1" and "LLaMA-3.2" in the model dropdowns. - Press the Load button. Ready to dive into the results! 2/ Compare Metric Results in the Results Tab 📊 - Head over to the Results tab. - Here, you’ll see the performance metrics for each model, beautifully color-coded using a gradient to highlight performance differences: greener is better! 🌟 - Want to focus on a specific task? Use the Task filter to hone in on comparisons for tasks like BBH or MMLU-Pro. 3/ Check Config Alignment in the Configs Tab ⚙️ - To ensure you’re comparing apples to apples, head to the Configs tab. - Review both models’ evaluation configurations, such as metrics, datasets, prompts, few-shot configs... - If something looks off, it’s good to know before drawing conclusions! ✅ 4/ Compare Predictions by Sample in the Details Tab 🔍 - Curious about how each model responds to specific inputs? The Details tab is your go-to! - Select a Task (e.g., MuSR) and then a Subtask (e.g., Murder Mystery) and then press the Load Details button. - Check out the side-by-side predictions and dive into the nuances of each model’s outputs. 5/ With this tool, it’s never been easier to explore how small changes between model versions affect performance on a wide range of tasks. Whether you’re a researcher or enthusiast, you can instantly visualize improvements and dive into detailed comparisons. 🚀 Try the 🤗 Open LLM Leaderboard Comparator now and take your model evaluations to the next level!
new
activity
4 months ago
akjindal53244/Llama-3.1-Storm-8B:
Languages report ?
View all activity
Articles
Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging
Aug 19, 2024
•
75
Organizations
akjindal53244
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
4 months ago
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
Sep 6, 2024
•
237M
•
9.33k
•
274
liked
a model
4 months ago
mattshumer/Reflection-Llama-3.1-70B
Text Generation
•
Updated
Sep 24, 2024
•
656
•
1.71k
liked
a Space
5 months ago
Runtime error
8
👁
Lama Storm 8b
liked
2 datasets
6 months ago
arcee-ai/agent-data
Viewer
•
Updated
Jul 22, 2024
•
486k
•
358
•
51
Magpie-Align/Magpie-Llama-3.1-Pro-300K-Filtered
Viewer
•
Updated
Aug 28, 2024
•
300k
•
95
•
12
liked
2 models
6 months ago
meta-llama/Llama-3.1-8B-Instruct
Text Generation
•
Updated
Sep 25, 2024
•
5.4M
•
•
3.45k
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct
Text Generation
•
Updated
Oct 31, 2024
•
4.24k
•
13
liked
a dataset
6 months ago
arcee-ai/The-Tome
Viewer
•
Updated
Aug 15, 2024
•
1.75M
•
380
•
82
liked
a model
7 months ago
AkshitaS/bhasha-embed-v0
Sentence Similarity
•
Updated
Jul 11, 2024
•
259
•
6
liked
3 datasets
7 months ago
nvidia/HelpSteer2
Viewer
•
Updated
28 days ago
•
21.4k
•
22.2k
•
394
HuggingFaceFW/fineweb-edu-llama3-annotations
Viewer
•
Updated
Jun 3, 2024
•
467k
•
203
•
39
gaia-benchmark/GAIA
Viewer
•
Updated
Mar 26, 2024
•
932
•
626
•
174
liked
a model
9 months ago
mistral-community/Mixtral-8x22B-v0.1
Text Generation
•
Updated
Jul 1, 2024
•
2.68k
•
673
liked
a dataset
10 months ago
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
•
Updated
Mar 10, 2024
•
143k
•
202
•
232
liked
a Space
11 months ago
Runtime error
31
🐢
LLM Training Cost Calculator
liked
2 models
about 1 year ago
meta-math/MetaMath-Mistral-7B
Text Generation
•
Updated
Dec 21, 2023
•
2.6k
•
95
mistralai/Mistral-7B-v0.1
Text Generation
•
Updated
Jul 24, 2024
•
3.21M
•
3.53k
liked
a Space
over 1 year ago
Running
on
CPU Upgrade
12.2k
🏆
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots