ZhuofengLi/tool-n1-multi-turn-reason-lora-sft-1180-step Text Generation • 8B • Updated Jul 14, 2025 • 4
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup Text Generation • 2B • Updated Mar 28, 2025 • 6