Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding Paper • 2512.17532 • Published Dec 19, 2025 • 68
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published Dec 23, 2025 • 51
V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models Paper • 2511.16668 • Published Nov 20, 2025 • 56
Perflow-Shuai/streaming_vlm_e1_lr2e-5_dt_rebuttal_stage2_ps512_pw512_from_qwen_run2-checkpoint-42-model 8B • Updated Nov 18, 2025 • 3
Perflow-Shuai/streaming_vlm_e1_lr2e-5_dt_rebuttal_stage2_ps512_pw512_from_qwen_run2-checkpoint-42-model 8B • Updated Nov 18, 2025 • 3
LongAI Collection Boost AI's Long ability, while keeping Efficient. Models in this collection includes LongVILA, LongVILA-R1, LongLive. • 8 items • Updated 15 days ago • 2
LongAI Collection Boost AI's Long ability, while keeping Efficient. Models in this collection includes LongVILA, LongVILA-R1, LongLive. • 8 items • Updated 15 days ago • 2
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback Paper • 2511.01678 • Published Nov 3, 2025 • 38
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published Oct 27, 2025 • 181