OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation Paper • 2601.15369 • Published 4 days ago • 16
Towards Automated Kernel Generation in the Era of LLMs Paper • 2601.15727 • Published 3 days ago • 13
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 4 days ago • 62
AgentEHR: Advancing Autonomous Clinical Decision-Making via Retrospective Summarization Paper • 2601.13918 • Published 5 days ago • 5
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Paper • 2601.11141 • Published 9 days ago • 18
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics Paper • 2601.14027 • Published 5 days ago • 11
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Paper • 2601.14171 • Published 5 days ago • 44
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents Paper • 2601.12346 • Published 7 days ago • 46
FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation Paper • 2601.13976 • Published 5 days ago • 20
UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation Paper • 2601.11522 • Published 9 days ago • 17
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 5 days ago • 34
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published 5 days ago • 49
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey Paper • 2601.11655 • Published 10 days ago • 59