UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5, 2025 • 52 • 6
Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Paper • 2510.12586 • Published Oct 14, 2025 • 108 • 3
Differentiable Solver Search for Fast Diffusion Sampling Paper • 2505.21114 • Published May 27, 2025 • 13 • 2
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Paper • 2504.13122 • Published Apr 17, 2025 • 20 • 4
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Paper • 2504.13122 • Published Apr 17, 2025 • 20 • 4
Simple and Effective Masked Diffusion Language Models Paper • 2406.07524 • Published Jun 11, 2024 • 12 • 2