FFAvatar: Few-Shot, Feed-Forward, and Generalizable Avatar Reconstruction Paper • 2605.15320 • Published 5 days ago • 5
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper • 2605.14386 • Published 5 days ago • 54
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published 10 days ago • 77
Soft Anisotropic Diagrams for Differentiable Image Representation Paper • 2604.21984 • Published 22 days ago • 5
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 25 days ago • 63
Coevolving Representations in Joint Image-Feature Diffusion Paper • 2604.17492 • Published about 1 month ago • 5
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition Paper • 2604.21689 • Published 26 days ago • 25
Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling Paper • 2604.05072 • Published Apr 10 • 18
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published Apr 11 • 81
GenLCA: 3D Diffusion for Full-Body Avatars from In-the-Wild Videos Paper • 2604.07273 • Published Apr 8 • 4
PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding Paper • 2604.00886 • Published Apr 1 • 6
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 125
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published Mar 25 • 28