DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 60
faezeb/tulu3_rewritten_400k_rubrics-final-nocode-filtered-ot2 Viewer • Updated Aug 18, 2025 • 44.4k • 11
faezeb/tulu3_rewritten_400k_rubrics-final-nocode-filtered-ot2 Viewer • Updated Aug 18, 2025 • 44.4k • 11
faezeb/tulu3_rewritten_400k_rubrics-single-verifiable-nocode-filetered-ot2 Viewer • Updated Aug 14, 2025 • 215k • 10
faezeb/tulu3_rewritten_400k_rubrics-single-verifiable-nocode-filetered-ot2 Viewer • Updated Aug 14, 2025 • 215k • 10
faezeb/verifiable-reasoning-v3-o4-mini-length-filtered-verified Viewer • Updated Aug 12, 2025 • 236k • 31
faezeb/verifiable-reasoning-v3-o4-mini-length-filtered-verified Viewer • Updated Aug 12, 2025 • 236k • 31