Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark Paper • 2510.13759 • Published Oct 15, 2025 • 10
Medical and Scientific Literature Models Collection Models for working with medical and scientific literature. • 15 items • Updated 13 days ago • 9
💉 🛡️ Nano-Guard BERT Collection A collection of 3 nano BERT models fine-tuned for prompt injection detection. Recommended for fast inference and/or edge devices • 3 items • Updated Dec 2, 2025 • 2
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 17 days ago • 93
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 3 days ago • 41
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLate’s ColBERT architecture and tuned for seven European languages. • 7 items • Updated Aug 3, 2025 • 20
DocLayout-YOLO Collection Dataset and model for DocLayout-YOLO • 10 items • Updated Jan 14, 2025 • 20
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation Paper • 2512.07829 • Published 27 days ago • 21
HyenaDNA Models Collection HyenaDNA models usable directly with Hugging Face classes like AutoModel. • 8 items • Updated Nov 14, 2023 • 20
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 12 days ago • 41
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Dec 4, 2025 • 184
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 59
view article Article TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval about 1 month ago • 18
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published Dec 2, 2025 • 67