MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published 4 days ago • 82
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 4 days ago • 115
ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models Paper • 2603.13033 • Published 8 days ago • 13
GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent Paper • 2603.13875 • Published 7 days ago • 29
Learning Latent Proxies for Controllable Single-Image Relighting Paper • 2603.15555 • Published 5 days ago • 8
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space Paper • 2603.12648 • Published 9 days ago • 12
DVD: Deterministic Video Depth Estimation with Generative Priors Paper • 2603.12250 • Published 9 days ago • 26
Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models Paper • 2602.10224 • Published Feb 10 • 19
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published Feb 11 • 30
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model Paper • 2602.10098 • Published Feb 10 • 19
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation Paper • 2602.09849 • Published Feb 10 • 16
Olaf-World: Orienting Latent Actions for Video World Modeling Paper • 2602.10104 • Published Feb 10 • 27
LatentMem: Customizing Latent Memory for Multi-Agent Systems Paper • 2602.03036 • Published Feb 3 • 14
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper • 2602.02474 • Published Feb 2 • 60