Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
266.4
TFLOPS
686
1312
4964
Victor Mustar
PRO
victor
Follow
olseth's profile picture
xiaoxiaofeifei's profile picture
shiqimei's profile picture
5,445 followers
·
1,723 following
victormustar
AI & ML interests
Building the UX of this website
Recent Activity
reacted
to
Juanxi
's
post
with 🔥
about 14 hours ago
📢 Awesome Multimodal Modeling We introduce Awesome Multimodal Modeling, a curated repository tracing the architectural evolution of multimodal intelligence—from foundational fusion to native omni-models. 🔹 Taxonomy & Evolution: Traditional Multimodal Learning – Foundational work on representation, fusion, and alignment. Multimodal LLMs (MLLMs) – Architectures connecting vision encoders to LLMs for understanding. Unified Multimodal Models (UMMs) – Models unifying Understanding + Generation via Diffusion, Autoregressive, or Hybrid paradigms. Native Multimodal Models (NMMs) – Models trained from scratch on all modalities; contrasts early vs. late fusion under scaling laws. 💡 Key Distinction: UMMs unify tasks via generation heads; NMMs enforce interleaving through joint pre-training. 🔗 Explore & Contribute: https://github.com/OpenEnvision-Lab/Awesome-Multimodal-Modeling
liked
a model
about 16 hours ago
MiniMaxAI/MiniMax-M2.7
liked
a Space
1 day ago
manasha2006/FoodCrisisEnv
View all activity
Organizations
victor
's buckets
9
victor/snapshots
3.18 MB
victor/qwen35-test-results
648 kB
victor/qwen35-test-scripts
47.8 kB
victor/autotrain-japanese-qwen35-2b
10.8 kB
victor/training-artifacts-v2
5.22 MB
victor/misc
175 kB
victor/test2323
0 Bytes
victor/caca2
0 Bytes
victor/hello
93 Bytes