Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 17
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • 15B • Updated Feb 24, 2025 • 770k • • 613
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF Text Generation • 31B • Updated Jan 30 • 181k • 526
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21, 2025 • 71.2k • • 1.31k
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 3.41k • 173