Mono-InternVL - a OpenGVLab Collection

OpenGVLab 's Collections

PVT

All-Seeing Project

Mono-InternVL

updated Oct 21

A Pioneering Monolithic MLLM

OpenGVLab/Mono-InternVL-2B

Image-Text-to-Text • Updated 4 days ago • 23.2k • 27
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10 • 3