Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wo-datacraft 's Collections
Audio Generation
3D Generation
Any-to-Any
Image Classification
Image Generation
Speech Generation
Speech Recognition
Text Generation - General
Text Generation - Reasoning
Text Generation - Vision
Toolkit - AI Papers
Toolkit - Datasets
Toolkit - Embeddings
Toolkit - Prompting Papers
Toolkit - Segmentation
Toolkit - Utilities
Video Generation

Text Generation - Vision

updated Feb 2
Upvote
-

  • google/gemma-3-27b-it

    Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 1.4M • • 1.91k

  • mistralai/Ministral-3-14B-Instruct-2512

    Updated Jan 15 • 210k • 262

  • Qwen/Qwen3-VL-30B-A3B-Instruct

    Image-Text-to-Text • Updated Nov 26, 2025 • 2.46M • • 550

  • Qwen/Qwen3-VL-30B-A3B-Thinking

    Image-Text-to-Text • Updated Nov 26, 2025 • 173k • • 191

  • moonshotai/Kimi-VL-A3B-Thinking-2506

    Image-Text-to-Text • Updated Jan 30 • 37.1k • 353

  • tencent/HunyuanOCR

    Image-Text-to-Text • Updated Jan 13 • 365k • 555

  • Running
    Featured
    397

    Qwen3 VL Demo

    😻
    397

    Chat with an AI that understands text, images, and videos


  • Running
    Featured
    109

    Qwen3 VL Demo

    😻
    109

    Chat with an AI assistant using text and images

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs