Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Benjy
's Collections
Multimodal
Image-to-Image
Image-to-Text
Speech Recognition
Text-to-Video
OCR
Image Models
Leading Research
Coding LLMs
Text to Image
Small LLMs
Leading LLMs
Multimodal
updated
6 days ago
Upvote
-
NexaAIDev/omnivision-968M
Updated
2 days ago
•
5.78k
•
338
Upvote
-
Share collection
View history
Collection guide
Browse collections