ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 12 days ago • 41
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 124
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated Dec 23, 2025 • 49
view article Article Model statistics of the 50 most downloaded entities on Hugging Face Oct 13, 2025 • 39
view article Article Entropic Instruction Following: Does Semantic Coherence Help LLMs Follow Instructions? Dec 3, 2025 • 1
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 185
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29, 2025 • 220
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1, 2025 • 138
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼💻 May 28, 2025 • 22
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 254
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? +5 May 11, 2025 • 94