12 6 18

Ashvini Kumar Jindal

akjindal53244

AI & ML interests

NLP

Recent Activity

new activity about 1 month ago

akjindal53244/Llama-3.1-Storm-8B:Adding Evaluation Results

reacted to albertvillanova's post with 👍 3 months ago

🚨 We’ve just released a new tool to compare the performance of models in the 🤗 Open LLM Leaderboard: the Comparator 🎉 https://huggingface.co/spaces/open-llm-leaderboard/comparator Want to see how two different versions of LLaMA stack up? Let’s walk through a step-by-step comparison of LLaMA-3.1 and LLaMA-3.2. 🦙🧵👇 1/ Load the Models' Results - Go to the 🤗 Open LLM Leaderboard Comparator: https://huggingface.co/spaces/open-llm-leaderboard/comparator - Search for "LLaMA-3.1" and "LLaMA-3.2" in the model dropdowns. - Press the Load button. Ready to dive into the results! 2/ Compare Metric Results in the Results Tab 📊 - Head over to the Results tab. - Here, you’ll see the performance metrics for each model, beautifully color-coded using a gradient to highlight performance differences: greener is better! 🌟 - Want to focus on a specific task? Use the Task filter to hone in on comparisons for tasks like BBH or MMLU-Pro. 3/ Check Config Alignment in the Configs Tab ⚙️ - To ensure you’re comparing apples to apples, head to the Configs tab. - Review both models’ evaluation configurations, such as metrics, datasets, prompts, few-shot configs... - If something looks off, it’s good to know before drawing conclusions! ✅ 4/ Compare Predictions by Sample in the Details Tab 🔍 - Curious about how each model responds to specific inputs? The Details tab is your go-to! - Select a Task (e.g., MuSR) and then a Subtask (e.g., Murder Mystery) and then press the Load Details button. - Check out the side-by-side predictions and dive into the nuances of each model’s outputs. 5/ With this tool, it’s never been easier to explore how small changes between model versions affect performance on a wide range of tasks. Whether you’re a researcher or enthusiast, you can instantly visualize improvements and dive into detailed comparisons. 🚀 Try the 🤗 Open LLM Leaderboard Comparator now and take your model evaluations to the next level!

new activity 4 months ago

akjindal53244/Llama-3.1-Storm-8B:Languages report ?

View all activity

Articles

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Aug 19, 2024

• 75

Organizations

akjindal53244's activity

liked a dataset 4 months ago

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 9.33k • 274

liked a model 4 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 656 • 1.71k

liked a Space 5 months ago

Runtime error

👁

Lama Storm 8b

liked 2 datasets 6 months ago

arcee-ai/agent-data

Viewer • Updated Jul 22, 2024 • 486k • 358 • 51

Magpie-Align/Magpie-Llama-3.1-Pro-300K-Filtered

Viewer • Updated Aug 28, 2024 • 300k • 95 • 12

liked 2 models 6 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 5.4M • • 3.45k

princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • Updated Oct 31, 2024 • 4.24k • 13

liked a dataset 6 months ago

arcee-ai/The-Tome

Viewer • Updated Aug 15, 2024 • 1.75M • 380 • 82

liked a model 7 months ago

AkshitaS/bhasha-embed-v0

liked 3 datasets 7 months ago

liked a model 9 months ago

mistral-community/Mixtral-8x22B-v0.1

Text Generation • Updated Jul 1, 2024 • 2.68k • 673

liked a dataset 10 months ago

WizardLMTeam/WizardLM_evol_instruct_V2_196k

Viewer • Updated Mar 10, 2024 • 143k • 202 • 232

liked a Space 11 months ago

Runtime error

🐢

LLM Training Cost Calculator

liked 2 models about 1 year ago

meta-math/MetaMath-Mistral-7B

Text Generation • Updated Dec 21, 2023 • 2.6k • 95

mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24, 2024 • 3.21M • 3.53k

liked a Space over 1 year ago

Running on CPU Upgrade

12.2k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots