Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jason500
's Collections
audio
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
caption
MMLM
mutil big modal image2text
updated
Nov 15, 2024
Upvote
-
OpenGVLab/InternVL-14B-224px
Image Feature Extraction
•
14B
•
Updated
Dec 9, 2024
•
215
•
35
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Jun 13, 2025
•
84.9k
•
1.03k
RhapsodyAI/MiniCPM-V-Embedding-preview
Feature Extraction
•
Updated
Aug 20, 2024
•
54
•
51
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
147k
•
•
1.56k
Upvote
-
Share collection
View history
Collection guide
Browse collections