MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published about 15 hours ago • 161
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 7 days ago • 218
view article Article Python Is All You Need? Introducing Dria-Agent-α By andthattoo • 5 days ago • 19
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 7 days ago • 75
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published 19 days ago • 78
view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 13 days ago • 37
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ By Sri-Vigneshwar-DJ • 10 days ago • 4
A New Approach for Explainable Multiple Organ Annotation with Few Data Paper • 1912.12932 • Published Dec 30, 2019 • 1
view article Article 🇪🇺✍️ EU AI Act: Systemic Risks in the First CoP Draft Comments ✍️🇪🇺 By yjernite • Dec 12, 2024 • 12
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper • 2412.09605 • Published Dec 12, 2024 • 27
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper • 2401.00812 • Published Jan 1, 2024 • 4
Awesome Computer Use Agents Collection https://github.com/ranpox/awesome-computer-use • 25 items • Updated 27 days ago • 7
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 59
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram • Dec 4, 2024 • 76
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published Nov 26, 2024 • 78