Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ceyda
's Collections
Korean Models
Useful Tools
vid-gen
Clips
VQA (Image captioning,QA)
Color
Nice~
Fashion
Cool names
VQA (Image captioning,QA)
updated
Aug 7
Upvote
-
Runtime error
35
📊
FuseCap
Running
on
T4
414
💻
Kosmos 2
Running
6
🚀
Vilt Nlvr
Build error
125
⚡
Qwen VL
Running
on
T4
373
🔥
LLaVA
Runtime error
308
👁
Fuyu Multimodal
Sleeping
157
🚀
MoE LLaVA
Running
on
Zero
166
🐨
IDEFICS2 Playground
Running
on
Zero
82
🐐
CuMo 7b Zero
Running
on
Zero
274
🐬
Chat with DeepSeek VL 7B
What matters when building vision-language models?
Paper
•
2405.02246
•
Published
May 3
•
98
Running
on
Zero
366
🌔
moondream2
a tiny vision language model
Running
on
Zero
94
📊
Idefics3
Upvote
-
Share collection
View history
Collection guide
Browse collections