majentik/Nemotron-3-Nano-30B-A3B-TurboQuant-MLX-4bit Text Generation • 32B • Updated Apr 17 • 178 • 1
view article Article We’re open-sourcing our text-to-image model and the process behind it Photoroom • Nov 12, 2025 • 99
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 22
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
deployed-models Collection Models that are currently deployed by the hf-inference provider • 1585 items • Updated 4 days ago • 39
🛩️Qwen3-VL Collection the most powerful vision-language model in the Qwen series to date. Available in Dense and MoE architectures • 5 items • Updated Oct 15, 2025
<7B Best of MoE 🧠 Collection Collection of Small size big impact MoE. • 4 items • Updated Oct 10, 2025