view post Post 10935 deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️ > pretty insane it can parse and re-render charts in HTML> it uses CLIP and SAM features concatenated, so better grounding> very efficient per vision tokens/performance ratio> covers 100 languages See translation
Apr 3 Releases netflix/void-model Video-to-Video • Updated about 22 hours ago • 528 arcee-ai/Trinity-Large-Thinking Text Generation • 399B • Updated 5 days ago • 7.74k • • 125 KRAFTON/Raon-VisionEncoder Feature Extraction • Updated 6 days ago • 438 • 18 KRAFTON/Raon-SpeechChat-9B Audio-to-Audio • 10B • Updated about 6 hours ago • 344 • 21
super cool vision language datasets ServiceNow/ui-vision Viewer • Updated May 7, 2025 • 1.46k • 4.96k • 21 xxxllz/Chart2Code-160k Updated Jul 7, 2025 • 234 • 11 ReCAP-Agent/ReCAP-187k-SFT Viewer • Updated 12 days ago • 188k • 38 • 6 allenai/MolmoPoint-GUISyn Viewer • Updated 5 days ago • 37k • 750 • 10
Apr 3 Releases netflix/void-model Video-to-Video • Updated about 22 hours ago • 528 arcee-ai/Trinity-Large-Thinking Text Generation • 399B • Updated 5 days ago • 7.74k • • 125 KRAFTON/Raon-VisionEncoder Feature Extraction • Updated 6 days ago • 438 • 18 KRAFTON/Raon-SpeechChat-9B Audio-to-Audio • 10B • Updated about 6 hours ago • 344 • 21
super cool vision language datasets ServiceNow/ui-vision Viewer • Updated May 7, 2025 • 1.46k • 4.96k • 21 xxxllz/Chart2Code-160k Updated Jul 7, 2025 • 234 • 11 ReCAP-Agent/ReCAP-187k-SFT Viewer • Updated 12 days ago • 188k • 38 • 6 allenai/MolmoPoint-GUISyn Viewer • Updated 5 days ago • 37k • 750 • 10
Running on CPU Upgrade 18 Daggr Image To 3d 👀 Convert images into 3D assets with background removal and enhancement