Training & test sets and finetuned models
-
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training
Paper • 2510.04996 • Published • 15 -
weqweasdas/math500
Viewer • Updated • 500 • 259 -
weqweasdas/aime_hmmt_brumo_cmimc_amc23
Viewer • Updated • 230 • 93 -
weqweasdas/olympiadbench
Viewer • Updated • 675 • 256