Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

Carbon Emissions

text-generation-inference

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Models

2,648

Full-text search

Active filters: vision

llava-hf/llava-onevision-qwen2-0.5b-ov-hf

Image-Text-to-Text • Updated 2 days ago • 69.2k • 15

tencent/DepthCrafter

Depth Estimation • Updated Sep 24 • 370k • 67

OpenGVLab/Mono-InternVL-2B

Image-Text-to-Text • Updated 4 days ago • 23.2k • 27

WinKawaks/vit-small-patch16-224

Image Classification • Updated Mar 18, 2023 • 1.27M • 16

naver-clova-ix/donut-base

Image-to-Text • Updated Aug 13, 2022 • 39.9k • 177

naver-clova-ix/donut-base-finetuned-docvqa

Document Question Answering • Updated Mar 9 • 16.7k • 204

MCG-NJU/videomae-large

Video Classification • Updated Apr 1 • 226k • 18

CIDAS/clipseg-rd64-refined

Image Segmentation • Updated Jan 4, 2023 • 10M • 115

Salesforce/blip2-opt-2.7b

Image-to-Text • Updated Mar 22 • 468k • 316

Salesforce/instructblip-vicuna-7b

Image-to-Text • Updated Apr 12 • 272k • 85

paragon-AI/blip2-image-to-text

Image-to-Text • Updated Jun 24, 2023 • 369 • 21

facebook/dinov2-base

Image Feature Extraction • Updated Jan 17 • 9.19M • 85

facebook/dpt-dinov2-base-kitti

Depth Estimation • Updated Nov 13, 2023 • 1.43k • 2

MahmoodLab/CONCH

Image Feature Extraction • Updated May 5 • 54.9k • 90

llava-hf/llava-v1.6-mistral-7b-hf

Image-Text-to-Text • Updated 2 days ago • 1.15M • 234

Aryn/deformable-detr-DocLayNet

Object Detection • Updated Aug 1 • 33.1k • 35

MahmoodLab/UNI

Image Feature Extraction • Updated May 10 • 77.1k • 191

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • Updated Oct 14 • 25k • 589

microsoft/llava-med-v1.5-mistral-7b

Image-Text-to-Text • Updated May 14 • 16.9k • 49

jinaai/jina-clip-v1

Feature Extraction • Updated Sep 11 • 65.6k • 227

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated about 6 hours ago • 126k • 148

OpenGVLab/InternVL2-1B

Image-Text-to-Text • Updated about 6 hours ago • 104k • 54

llava-hf/llama3-llava-next-8b-hf

Image-Text-to-Text • Updated 2 days ago • 73.1k • 29

multimodalart/Florence-2-large-no-flash-attn

Image-Text-to-Text • Updated Aug 29 • 56.2k • 8

Sense-X/uniformer_image

Image Classification • Updated Feb 9, 2022 • 5

WinKawaks/vit-tiny-patch16-224

Image Classification • Updated Mar 30, 2023 • 5.03M • 17

facebook/detr-resnet-101

Object Detection • Updated Dec 14, 2023 • 214k • 113

facebook/detr-resnet-50-dc5

Object Detection • Updated Sep 7, 2023 • 2.99k • 6

facebook/detr-resnet-50-panoptic

Image Segmentation • Updated Apr 10 • 11.7k • 128

facebook/dino-vitb16

Image Feature Extraction • Updated May 22, 2023 • 270k • 104