Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation Paper • 2604.10030 • Published 5 days ago • 12 • 2
Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach Paper • 2604.11547 • Published 3 days ago • 4 • 2
Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation Paper • 2604.11290 • Published 3 days ago • 1 • 2
Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models Paper • 2604.02340 • Published 5 days ago • 6 • 2
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 3 days ago • 23 • 2
Learning Long-term Motion Embeddings for Efficient Kinematics Generation Paper • 2604.11737 • Published 3 days ago • 4 • 2
SuperSuit: An Isomorphic Bimodal Interface for Scalable Mobile Manipulation Paper • 2603.06280 • Published Mar 6 • 1
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 5 days ago • 67 • 3
Zero-shot World Models Are Developmentally Efficient Learners Paper • 2604.10333 • Published 5 days ago • 6 • 2
IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs Paper • 2604.10539 • Published 4 days ago • 1 • 2
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 3 days ago • 62 • 2
Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks Paper • 2604.11753 • Published 3 days ago • 12 • 2
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences? Paper • 2604.10718 • Published 4 days ago • 2 • 1