Collections

Discover the best community collections!

Collections including paper arxiv:2310.03744
multilingual vision models
Some papers I read for understanding vision models and also adding multilingual capabilities to them
Multimodal Papers
Collection by Apr 22
Vision Language Models Papers 🖼️💬📝
Papers about vision-language models, most important ones are on top of the list.
LLaVa-NeXT
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.