Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 11 days ago • 48
MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning Paper • 2603.12266 • Published 25 days ago • 19
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published Feb 26 • 201
Remote Sensing Referring Expression Understanding Collection REU task for RS. • 5 items • Updated Oct 2, 2025 • 1
Remote Sensing Referring Expression Understanding Collection REU task for RS. • 5 items • Updated Oct 2, 2025 • 1