AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts Paper • 2601.20730 • Published 18 days ago • 19