Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

[Question] Are no models being evaluated due to the lighteval/MATH-Hard dataset being unavailable?

#1071

by T145 - opened 2 days ago

T145

2 days ago

I noticed the backlog, tried to run tests myself, and came across this issue:
https://github.com/huggingface/lighteval/issues/494

NitoM

1 day ago

Is that why deepseek v3 is not in the list? I want to see this 685B monster crushing it.

Open LLM Leaderboard org about 19 hours ago

Models still have access to the dataset for evaluation - cluster is full however, hence evaluations go more slowly :)

clefourrier changed discussion status to closed about 19 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment