view article Article Is it agentic enough? Benchmarking open models on your own tooling +1 lysandre, SaylorTwift, pcuenq • 3 days ago • 17
view article Article Is it agentic enough? Benchmarking open models on your own tooling +1 lysandre, SaylorTwift, pcuenq • 3 days ago • 17
view article Article The Open Source Community is backing OpenEnv for Agentic RL +16 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego • 13 days ago • 89
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research Paper • 2606.07591 • Published 24 days ago • 93
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 17 days ago • 57