JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion Paper • 2601.22143 • Published Jan 29 • 8
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 7 days ago • 25
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 7 days ago • 25
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion Paper • 2601.22143 • Published Jan 29 • 8
Gaussian Mixture Generative Adversarial Networks for Diverse Datasets, and the Unsupervised Clustering of Images Paper • 1808.10356 • Published Aug 30, 2018 • 1
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 7 days ago • 25
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 7 days ago • 25
view post Post 23444 Want to iterate on a Hugging Face Space with an LLM? Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model! multimodalart/repo2txt See translation 1 reply · 🤗 3 3 👍 2 2 🚀 1 1 + Reply
view post Post 18280 Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐I've built a live real time demo on Spaces 📹💨 multimodalart/self-forcing See translation 6 replies · ❤️ 12 12 🔥 6 6 + Reply