LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 8 days ago • 59
AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding Paper • 2606.06155 • Published 8 days ago • 10
Neural Networks Provably Learn Spectral Representations for Group Composition Paper • 2606.02993 • Published 10 days ago • 6
Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling Paper • 2606.03102 • Published 9 days ago • 14
Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling Paper • 2606.03102 • Published 9 days ago • 14
Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism Paper • 2605.30852 • Published 14 days ago • 10
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models Paper • 2605.18879 • Published 23 days ago • 8
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 16 days ago • 90
Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling Paper • 2605.27030 • Published 17 days ago • 31
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 21 days ago • 225
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 23 days ago • 50
The Unlearnability Phenomenon in RLVR for Language Models Paper • 2605.16787 • Published 27 days ago • 6
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published May 11 • 17
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published May 11 • 17
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published May 8 • 69
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published May 8 • 69
Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation Paper • 2602.03689 • Published Feb 3