ypwang61/One-Shot-RLVR-R1-Distill-1.5B-1.2k-dsr-sub Text Generation • 2B • Updated Aug 27, 2025 • 2
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-1.2k-dsr-sub Text Generation • 8B • Updated Aug 27, 2025 • 112
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-7.5k-MATH Text Generation • 2B • Updated Aug 27, 2025 • 55
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-1.2k-dsr-sub Text Generation • 2B • Updated Aug 27, 2025 • 55
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1_pi13 Text Generation • 2B • Updated May 19, 2025 • 3
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-pi1_pi13 Text Generation • 8B • Updated May 19, 2025 • 11