Improved Baselines with Visual Instruction Tuning
Paper
•
2310.03744
•
Published
•
37
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.