Perception-Aware Policy Optimization for Multimodal Reasoning Paper • 2507.06448 • Published Jul 8, 2025 • 47
AR-RAG: Autoregressive Retrieval Augmentation for Image Generation Paper • 2506.06962 • Published Jun 8, 2025 • 28
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer Paper • 2506.06952 • Published Jun 8, 2025 • 9
Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models Paper • 2504.20157 • Published Apr 28, 2025 • 37