view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 10 days ago • 816
view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 11 days ago • 34
view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 16 days ago • 36
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 6 items • Updated about 19 hours ago • 24
3DV 2026 Collection Collection of all the 3DV models, datasets and demos • 27 items • Updated 17 days ago • 4
GSFix3D Collection Diffusion model collections for paper "GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting" • 4 items • Updated Nov 18, 2025 • 2
MedTech Open Models Collection Open models for physical AI and medical imaging — robot control, surgical simulation, segmentation, reconstruction, generation, and reasoning. • 13 items • Updated 5 days ago • 7
view article Article ✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use Jan 3, 2025 • 24
view article Article How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II about 1 month ago • 30
view article Article IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST Feb 18 • 18
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output Feb 7 • 22
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 Feb 4 • 88