ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
Paper
• 2602.06820 • Published
• 13
None defined yet.
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs