Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
vlm
custom_code
AutoTrain Compatible
Inference Endpoints
text-generation-inference
4-bit precision
Misc with no match
Eval Results
Merge
text-embeddings-inference
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
32
Full-text search
Edit filters
Sort: Trending
Active filters:
vlm
Clear all
unum-cloud/uform-gen2-qwen-500m
Image-to-Text
•
Updated
Apr 24
•
26.9k
•
71
unum-cloud/uform-gen
Image-to-Text
•
Updated
Dec 31, 2023
•
399
•
42
unum-cloud/uform-gen-chat
Visual Question Answering
•
Updated
Dec 31, 2023
•
84
•
20
4bit/uform-gen
Image-to-Text
•
Updated
Dec 31, 2023
•
17
•
2
unum-cloud/uform-gen2-dpo
Image-to-Text
•
Updated
Apr 24
•
4.06k
•
42
MonolithFoundation/Bumblebee
Text Generation
•
Updated
Apr 28
•
9
•
4
sujet-ai/Lutece-Vision-Base
Image-to-Text
•
Updated
Jul 14
•
26
•
5
TIGER-Lab/Mantis-8B-siglip-llama3
Image-Text-to-Text
•
Updated
7 days ago
•
3.97k
•
31
TIGER-Lab/Mantis-8B-clip-llama3
Image-Text-to-Text
•
Updated
7 days ago
•
1.35k
•
1
TIGER-Lab/Mantis-8B-Fuyu
Text Generation
•
Updated
May 4
•
234
•
4
MischaQI/SNIFFER
Updated
May 15
•
1
TIGER-Lab/Mantis-8B-Idefics2
Image-Text-to-Text
•
Updated
7 days ago
•
534
•
10
hiyouga/PaliGemma-3B-Chat-v0.1
Image-Text-to-Text
•
Updated
Jul 1
•
170
•
11
BUAADreamer/PaliGemma-3B-Chat-v0.2
Image-Text-to-Text
•
Updated
Jun 5
•
67
•
6
JosefAlbers/Phi-3-vision-128k-instruct-mlx
Updated
Jun 16
•
2
•
1
AlanaAI/AlanaVLM
Updated
Jul 4
amitha/mllava-baichuan2-en
Visual Question Answering
•
Updated
Jun 19
•
2
amitha/mllava-baichuan2-zh
Visual Question Answering
•
Updated
Jun 19
•
3
amitha/mllava-baichuan2-en-zh
Visual Question Answering
•
Updated
Jun 19
•
4
amitha/mllava-llama2-en
Visual Question Answering
•
Updated
Jun 19
•
6
amitha/mllava-llama2-zh
Visual Question Answering
•
Updated
Jun 19
•
1
amitha/mllava-llama2-en-zh
Visual Question Answering
•
Updated
Jun 19
•
3
variante/llava-1.5-7b-llara-D-inBC-VIMA-80k
Image-Text-to-Text
•
Updated
Jul 13
•
14
•
1
variante/llava-1.5-7b-llara-D-inBC-Aux-D-VIMA-80k
Image-Text-to-Text
•
Updated
Jul 13
•
12
•
1
variante/llara-maskrcnn
Object Detection
•
Updated
Jul 1
•
1
variante/llava-1.5-7b-llara-D-inBC-Aux-B-VIMA-80k
Image-Text-to-Text
•
Updated
Jul 15
•
233
•
1
sachin/vlm-mobilenetv4-smolLM
Updated
Aug 18
•
1
variante/llava-1.5-7b-llara-D-RT2-Style-VIMA-80k
Image-Text-to-Text
•
Updated
Aug 28
•
2
cyan2k/molmo-7B-O-bnb-4bit
Text Generation
•
Updated
Sep 26
•
1.13k
•
8
impactframes/molmo-7B-O-bnb-4bit
Text Generation
•
Updated
Oct 2
•
12
Previous
1
2
Next