Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
vision
Inference Endpoints
AutoTrain Compatible
Eval Results
Carbon Emissions
custom_code
text-generation-inference
4-bit precision
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Apply filters
Models
2,648
Full-text search
Edit filters
Sort: Trending
Active filters:
vision
Clear all
llava-hf/llava-onevision-qwen2-0.5b-ov-hf
Image-Text-to-Text
•
Updated
2 days ago
•
69.2k
•
15
tencent/DepthCrafter
Depth Estimation
•
Updated
Sep 24
•
370k
•
67
OpenGVLab/Mono-InternVL-2B
Image-Text-to-Text
•
Updated
4 days ago
•
23.2k
•
27
WinKawaks/vit-small-patch16-224
Image Classification
•
Updated
Mar 18, 2023
•
1.27M
•
16
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
39.9k
•
177
naver-clova-ix/donut-base-finetuned-docvqa
Document Question Answering
•
Updated
Mar 9
•
16.7k
•
204
MCG-NJU/videomae-large
Video Classification
•
Updated
Apr 1
•
226k
•
18
CIDAS/clipseg-rd64-refined
Image Segmentation
•
Updated
Jan 4, 2023
•
10M
•
115
Salesforce/blip2-opt-2.7b
Image-to-Text
•
Updated
Mar 22
•
468k
•
316
Salesforce/instructblip-vicuna-7b
Image-to-Text
•
Updated
Apr 12
•
272k
•
85
paragon-AI/blip2-image-to-text
Image-to-Text
•
Updated
Jun 24, 2023
•
369
•
21
facebook/dinov2-base
Image Feature Extraction
•
Updated
Jan 17
•
9.19M
•
85
facebook/dpt-dinov2-base-kitti
Depth Estimation
•
Updated
Nov 13, 2023
•
1.43k
•
2
MahmoodLab/CONCH
Image Feature Extraction
•
Updated
May 5
•
54.9k
•
90
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
2 days ago
•
1.15M
•
234
Aryn/deformable-detr-DocLayNet
Object Detection
•
Updated
Aug 1
•
33.1k
•
35
MahmoodLab/UNI
Image Feature Extraction
•
Updated
May 10
•
77.1k
•
191
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14
•
25k
•
589
microsoft/llava-med-v1.5-mistral-7b
Image-Text-to-Text
•
Updated
May 14
•
16.9k
•
49
jinaai/jina-clip-v1
Feature Extraction
•
Updated
Sep 11
•
65.6k
•
227
OpenGVLab/InternVL2-8B
Image-Text-to-Text
•
Updated
about 6 hours ago
•
126k
•
148
OpenGVLab/InternVL2-1B
Image-Text-to-Text
•
Updated
about 6 hours ago
•
104k
•
54
llava-hf/llama3-llava-next-8b-hf
Image-Text-to-Text
•
Updated
2 days ago
•
73.1k
•
29
multimodalart/Florence-2-large-no-flash-attn
Image-Text-to-Text
•
Updated
Aug 29
•
56.2k
•
8
Sense-X/uniformer_image
Image Classification
•
Updated
Feb 9, 2022
•
5
WinKawaks/vit-tiny-patch16-224
Image Classification
•
Updated
Mar 30, 2023
•
5.03M
•
17
facebook/detr-resnet-101
Object Detection
•
Updated
Dec 14, 2023
•
214k
•
113
facebook/detr-resnet-50-dc5
Object Detection
•
Updated
Sep 7, 2023
•
2.99k
•
6
facebook/detr-resnet-50-panoptic
Image Segmentation
•
Updated
Apr 10
•
11.7k
•
128
facebook/dino-vitb16
Image Feature Extraction
•
Updated
May 22, 2023
•
270k
•
104
Previous
1
2
3
4
...
89
Next